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Chapter 1 

Introduction to Statistics and Data 
Analysis 


1.1 (a) 15. 

(b) x = ^(3.4 + 2.5 + 4.8 + • • • + 4.8) = 3.787. 

(c) Sample median is the 8th value, after the data is sorted from smallest to largest: 
3.6. 

(d) A dot plot is shown below. 

• • 



(e) After trimming total 40% of the data (20% highest and 20% lowest), the data 
becomes: 

2.9 3.0 3.3 3.4 3.6 
3.7 4.0 4.4 4.8 

So. the trimmed mean is 

3h r 2o = tt( 2.9 "b ^'0 + ■ ■ ■ 4" 4.8) = 3.678. 

y 


1.2 (a) Mean=20.768 and Median=20.610. 

(b) x tr io = 20.743. 

(c) A dot plot is shown below. 

• • • m • • •••• • • m • • •• • 

I I I I I I 

18 19 20 21 22 23 
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1.3 (a) A dot plot is shown below. 

o 

O OOO O O OOxxxO xx x xxx 

200 205 210 215 220 225 230 

In the figure, “x” represents the “No aging” group and “o” represents the “Aging” 
group. 

(b) Yes; tensile strength is greatly reduced due to the aging process. 

(c) Mean A g in g = 209.90, and MeaiiNo aging = 222.10. 

(d) MediariAging = 210.00, and Median No aging = 221.50. The means and medians for 
each group are similar to each other. 

1.4 (a) = 7.950 and Xa = 8.250; 

X B = 10.260 and X B = 10.150. 

(b) A dot plot is shown below. 

xxxx x xxx xx OOOO OO OO 

6.5 7X5 8X5 9X5 105 Ti.5 

In the figure, “x” represents company A and “o” represents company B. The 
steel rods made by company B show more flexibility. 

1.5 (a) A dot plot is shown below. 


oqx o 

xO x xxox x x O x x O 

-10 0 10 20 30 40 

In the figure, “x” represents the control group and “o” represents the treatment 
group. 

(b) Yc on t r oi = 5.60, Ab on t ro i = 5.00, and Y^ r (io); Control = 5.13; 

ATreatment 7.60, A'jYeatment 4.50, and At r (lo);Treatment 5.63. 

(c) The difference of the means is 2.0 and the differences of the medians and the 
trimmed means are 0.5, which are much smaller. The possible cause of this might 
be due to the extreme values (outliers) in the samples, especially the value of 37. 

1.6 (a) A dot plot is shown below, 
o 

x o x o 

O Oxx x xox xxoxxx O OO OO 

1.95 Z05 245 225 2X35 ^45 Z55 

In the figure, “x” represents the 20°C group and “o” represents the 45°C group. 

(b) A 20 o C = 2.1075, and X 45 o C = 2.2350. 

(c) Based on the plot, it seems that high temperature yields more high values of 
tensile strength, along with a few low values of tensile strength. Overall, the 
temperature does have an influence on the tensile strength. 
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1.7 

1.8 
1.9 

1.10 

1.11 

1.12 

1.13 

1.14 

1.15 

1.16 


1.17 


(d) It also seems that the variation of the tensile strength gets larger when the cure 
temperature is increased. 

s 2 = _L_[(3.4_3.787) 2 + (2.5-3.787) 2 + (4.8-3.787) 2 + - • • + (4.8 — 3.787) 2 ] = 0.94284; 
s = Vs 2 = VO. 9428 = 0.971. 

s 2 = 2 q 3 i [(18.71 - 20.768) 2 + (21.41 - 20.768) 2 + • • • + (21.12 - 20.768) 2 ] = 2.5345; 
s = V2.5345 = 1.592. 

4o Aging = Io4[( 227 - 222. 10) 2 + (222 - 222.10) 2 + • • • + (221 - 222.10) 2 ] = 42.12; 
-SNo Aging = V 42.12 = 6.49. 

^ Aging = t4t [( 219 - 209.90) 2 + (214 - 209.90) 2 + • • • + (205 - 209.90) 2 ] = 23.62; 
s Aging = V23.62 = 4.86. 

For company A: s\ = 1.2078 and = Vl-2078 = 1.099. 

For company B: s 2 B = 0.3249 and sb = V0.3249 = 0.570. 

For the control group: Scontroi = 69.39 and s control = 8.33. 

For the treatment group: s^eatment = 128.14 and s T reatment = 11.32. 

For the cure temperature at 20°C: s|o°c = 0.005 and S2o°c = 0.071. 

For the cure temperature at 45°C: <sl 5 o C = 0.0413 and S450C = 0.2032. 

The variation of the tensile strength is influenced by the increase of cure temperature. 

(a) Mean = X = 124.3 and median = X = 120; 

(b) 175 is an extreme observation. 

(a) Mean = X = 570.5 and median = X = 571; 

(b) Variance = s 2 = 10; standard deviation^ s = 3.162; range=10; 

(c) Variation of the diameters seems too big. 

Yes. The value 0.03125 is actually a P-value and a small value of this quantity means 
that the outcome (i.e., HHHHH) is very unlikely to happen with a fair coin. 

The term on the left side can be manipulated to 

n n n 

Xj nx = Xj - x.i = 0 , 
i= 1 i— 1 i= 1 

which is the term on the right side. 

(a) V smokers 43.70 and Y nonsmokers 30.32, 

(b) ^smokers 16.93 and <s nonsmokers 7.13, 
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(c) A dot plot is shown below. 

o 

x xOO xxxxkxCXxxxxO <3D ODO O O 

10 20 30 40 50 60 70 

In the hgure, “x” represents the nonsmoker group and “o” represents the smoker 
group. 

(d) Smokers appear to take longer time to fall asleep and the time to fall asleep for 
smoker group is more variable. 

1.18 (a) A stem-and-leaf plot is shown below. 


Stem 

Leaf 

Frequency 

1 

057 

3 

2 

35 

2 

3 

246 

3 

4 

1138 

4 

5 

22457 

5 

6 

00123445779 

11 

7 

01244456678899 

14 

8 

00011223445589 

14 

9 

0258 

4 


(b) The following is the relative frequency distribution table. 

Relative Frequency Distribution of Grades 


Class Interval 

Class Midpoint 

Frequency, / 

Relative Frequency 

10-19 

14.5 

3 

0.05 

20-29 

24.5 

2 

0.03 

30-39 

34.5 

3 

0.05 

40-49 

44.5 

4 

0.07 

50-59 

54.5 

5 

0.08 

60-69 

64.5 

11 

0.18 

70-79 

74.5 

14 

0.23 

80-89 

84.5 

14 

0.23 

90-99 

94.5 

4 

0.07 


(c) A histogram plot is given below. 



14.5 24.5 34.5 44.5 54.5 64.5 74.5 84.5 94.5 

Final Exam Grades 
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The distribution skews to the left. 

(d) X = 65.48, X = 71.50 and s = 21.13. 

1.19 (a) A stem-and-leaf plot is shown below. 


Stem 

Leaf 

Frequency 

0 

22233457 

8 

1 

023558 

6 

2 

035 

3 

3 

03 

2 

4 

057 

3 

5 

0569 

4 

6 

0005 

4 


(b) The following is the relative frequency distribution table. 


Relative Frequency Distribution of Years 


Class Interval 

Class Midpoint 

Frequency, / 

Relative Frequency 

o 

0 

1 

o 

io 

0.45 

8 

0.267 

1.0 - 1.9 

1.45 

6 

0.200 

2.0 - 2.9 

2.45 

3 

0.100 

3.0 -3.9 

3.45 

2 

0.067 

4.0 -4.9 

4.45 

3 

0.100 

5.0 -5.9 

5.45 

4 

0.133 

6.0 -6.9 

6.45 

4 

0.133 


(c) X = 2.797, s = 2.227 and Sample range is 6.5 — 0.2 = 6.3. 
1.20 (a) A stem-and-leaf plot is shown next. 


Stem 

Leaf 

Frequency 

0* 

34 

2 

0 

56667777777889999 

17 

1* 

0000001223333344 

16 

1 

5566788899 

10 

2* 

034 

3 

2 

7 

1 

3* 

2 

1 


(b) The relative frequency distribution table is shown next. 
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Relative Frequency Distribution of Fruit Fly Lives 


Class Interval 

Class Midpoint 

Frequency, / 

Relative Frequency 

0-4 

2 

2 

0.04 

5-9 

7 

17 

0.34 

10 - 14 

12 

16 

0.32 

15-19 

17 

10 

0.20 

20-24 

22 

3 

0.06 

25-29 

27 

1 

0.02 

30-34 

32 

1 

0.02 


(c) A histogram plot is shown next. 



2 7 12 17 22 27 32 

Fruit fly lives (seconds) 


(d) X = 10.50. 

1.21 (a) X = 1.7743 and X = 1.7700; 

(b) s = 0.3905. 

1.22 (a) X = 6.7261 and X = 0.0536. 

(b) A histogram plot is shown next. 



6.62 6.66 6.7 6.74 6.78 6.82 

Relative Frequency Fiistogram for Diameter 

(c) The data appear to be skewed to the left. 


(a) A dot plot is shown next. 


160.15 


GBXXK 

100 


395.10 

®®0 x x 

400 500 


(b) X ] ggo — 395.1 and A^ggo — 160.2. 
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(c) The sample mean for 1980 is over twice as large as that of 1990. The variability 
for 1990 decreased also as seen by looking at the picture in (a). The gap represents 
an increase of over 400 ppm. It appears from the data that hydrocarbon emissions 
decreased considerably between 1980 and 1990 and that the extreme large emission 
(over 500 ppm) were no longer in evidence. 

1.24 (a) X = 2.8973 and s = 0.5415. 

(b) A histogram plot is shown next. 



1.8 2.1 2.4 2.7 3 3.3 3.6 3.9 

Salaries 


(c) Use the double-stem-and-leaf plot, we have the following. 


Stem 

Leaf 

Frequency 

1 

(84) 

1 

2* 

(05) (10) (14) (37) (44) (45) 

6 

2 

(52) (52) (67) (68) (71) (75) (77) (83) (89) (91) (99) 

11 

3* 

(10) (13) (14) (22) (36) (37) 

6 

3 

(51)(54)(57)(71)(79)(85) 

6 


1.25 (a) X = 33.31; 

(b) X = 26.35; 

(c) A histogram plot is shown next. 



10 20 30 40 50 60 70 80 90 

Percentage of the families 
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(d) A" tr (io) = 30.97. This trimmed mean is in the middle of the mean and median 
using the full amount of data. Due to the skewness of the data to the right (see 
plot in (c)), it is common to use trimmed data to have a more robust result. 

1.26 If a model using the function of percent of families to predict staff salaries, it is likely 
that the model would be wrong due to several extreme values of the data. Actually if 
a scatter plot of these two data sets is made, it is easy to see that some outlier would 
influence the trend. 

1.27 (a) The averages of the wear are plotted here. 



700 800 900 1000 1100 1200 1300 

load 


(b) When the load value increases, the wear value also increases. It does show certain 
relationship. 

(c) A plot of wears is shown next. 



700 800 900 1000 1100 1200 1300 

load 


(d) The relationship between load and wear in (c) is not as strong as the case in (a), 
especially for the load at 1300. One reason is that there is an extreme value (750) 
which influence the mean value at the load 1300. 

1.28 (a) A dot plot is shown next. 

High Low 

OCOOS O OGSO xx x 8 x x x 

71.45 71.65 71.85 72.05 72.25 72.45 72.65 72.85 

In the figure, “x” represents the low-injection- velocity group and “o” represents 
the high-injection-velocity group. 
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(b) It appears that shrinkage values for the low-injection-velocity group is higher than 
those for the high-injection-velocity group. Also, the variation of the shrinkage 
is a little larger for the low injection velocity than that for the high injection 
velocity. 


1.29 (a) A dot plot is shown next. 

Low 

>90K 

76 79 82 85 88 


High 

O ©BEX© 
91 94 


In the figure, “x” represents the low-injection- velocity group and “o” represents 
the high-injection-velocity group. 

(b) In this time, the shrinkage values are much higher for the high-injection- velocity 
group than those for the low-injection-velocity group. Also, the variation for the 
former group is much higher as well. 

(c) Since the shrinkage effects change in different direction between low mode tem- 
perature and high mold temperature, the apparent interactions between the mold 
temperature and injection velocity are significant. 


1.30 An interaction plot is shown next. 


mean shrinkage value 



+- 


Low 


high 


low mold temp 

-injection velocity 


It is quite obvious to find the interaction between the two variables. Since in this exper- 
imental data, those two variables can be controlled each at two levels, the interaction 
can be investigated. However, if the data are from an observational studies, in which 
the variable values cannot be controlled, it would be difficult to study the interactions 
among these variables. 




Chapter 2 


Probability 


2.1 (a) 5 = {8,16,24,32,40,48}. 

(b) For x 2 + Ax — 5 = (x + 5)(x — 1) = 0, the only solutions are x = — 5 and x — 1. 
5 = {-5,1}. 

(c) S = {T,HT,HHT,HHH}. 

(d) S = {N. America, S. America, Europe, Asia, Africa, Australia, Antarctica}. 

(e) Solving 2x — 4 > 0 gives x > 2. Since we must also have x < 1, it follows that 
S = <p. 


2.2 S = {(x, y) | x 2 + y 2 < 9; x > 0, y > 0}. 

2.3 (a) A = {1,3}. 

(b) B = { 1,2, 3, 4, 5, 6}. 

(c) C = {x | x 2 — Ax + 3 = 0} = {x | (x — l)(x — 3) = 0} = {1, 3}. 

(d) D = {0, 1, 2, 3, 4, 5, 6}. Clearly, A = C. 


2.4 (a) S = {(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), 

(3. 1) , (3, 2), (3, 3), (3, 4), (3, 5), (3, 6), (4, 1), (4, 2), (4, 3), (4, 4), (4, 5), (4, 6), 

(5. 1) , (5, 2), (5, 3), (5, 4), (5, 5), (5, 6), (6, 1), (6, 2), (6, 3), (6, 4), (6, 5), (6, 6)}. 

(b) S = {(x,y) | 1 < x,y < 6}. 


2.5 S = {1 HH, 1HT, 1 TH, ITT , 2H, 2 T, 3 HH, 3 HT, 3 TH, 3 TT, AH, AT, 5 HH, 5 HT, 5 TH, 
5TT, 6H, 6T}. 

2.6 S = {A1A2, A1A3, A1A4, A2A3, A2A4, A3A4}. 


2.7 Si = {M M M M, MMMF, MMFM, MFMM, FMMM, MMFF, MFMF, MFFM, 
FMFM , FFMM, FMMF, MFFF, FMFF, FFMF, FFFM, FFFF}. 

S 2 = { 0 , 1 , 2 , 3 , 4 }. 

2.8 (a) A = {(3, 6), (4, 5), (4, 6), (5, 4), (5, 5), (5, 6), (6, 3), (6, 4), (6, 5), (6, 6)}. 
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(b) B = {( 1, 2), (2, 2), (3, 2), (4, 2), (5, 2), (6, 2), (2, 1), (2, 3), (2, 4), 
(2, 5), (2, 6)}. 


(d) Anc = {(5, 4), (5, 5), (5, 6), (6, 3), (6, 4), (6, 5), (6, 6)}. 

(e) A n B = </>. 

(f) BHC = {(5,2), (6,2)}. 

(g) A Venn diagram is shown next. 



2.9 (a) A = {1 HH, 1HT, 1TH, ITT, 2 H, 2 T}. 

(b) B = {ITT,3TT,5TT}. 

(c) A' = {3 HH, 3HT, 3TH, 3TT, AH, AT, 5 HH, 5 HT, 5 TH, 5 TT, 6 H, 6 T}. 

(d) A' A B = {3TT, 5TT}. 

(e) A U B = {1 HH, 1 HT, 1 TH, ITT, 2 H, 2T, 3 TT, 5 TT}. 

2.10 (a) S = {FFF, FFN, FNF, NFF, FNN, NFN, NNF, NNN}. 

(b) E = {FFF, FFN, FNF, NFF}. 

(c) The second river was safe for fishing. 

2.11 (a) S = {MiM 2 , M j F i , M \ F 2 , M 2 M\, M 2 F\ , A / I 2 F 2 , F\ M y , F\ M 2 , F\F 2 , F 2 Mi, F 2 M 2 , 

f 2 f 1 }. 

(b) A = { M i M 2 , M i F i , M i F 2 , M 2 M\ , M 2 Fi, M 2 F 2 }. 

(c) B = {M\F\, M i F 2 , M 2 F\, M 2 F 2 , F\M\, F\M 2 , F 2 M\ , F 2 M 2 }. 

(d) C = {F 1 F 2 ,F 2 F 1 }. 

(e) A n B = {M 1 F 1 , M x F 2 , M 2 F x , M 2 F 2 }. 

(f) A U C = [M\M 2 , AfiFx, M\F 2 , M 2 M x , M 2 F x , M 2 F 2 , F x F 2 , F-^}. 
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2.12 (a) S — {ZYF, ZNF, WYF, WNF , SYF, SNF, ZYM}. 

(b) A U B = {ZYF, ZNF, WYF, WNF, SYF , SNF} = A. 

(c) AC B = {WYF, SYF}. 

2.13 A Venn diagram is shown next. 



2.14 (a) A U C = {0, 2, 3, 4, 5, 6, 8}. 

(b) An £ = </>. 

(c) C' — {0, 1 , 6, 7, 8, 9}. 

(d) C'HD = {1, 6, 7}, so (C n D) U B = {1, 3, 5, 6, 7, 9}. 

(e) (SnC)' = C' = {0,1, 6, 7, 8, 9}. 

(f) A n c = {2, 4},so4ncnh' = { 2 , 4}. 

2.15 (a) A' — {nitrogen, potassium, uranium, oxygen}. 

(b) AUC = {copper, sodium, zinc, oxygen}. 

(c) An B' = {copper, zinc} and 

C = {copper, sodium, nitrogen, potassium, uranium, zinc}; 

so (A n B ') U C' = {copper, sodium, nitrogen, potassium, uranium, zinc}. 
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(d) B' n C' = {copper, uranium, zinc}. 

(e) ApBCC = <t). 

(f) A' U B' — {copper, nitrogen, potassium, uranium, oxygen, zinc} and 
d'nC= {oxygen}; so, {A' U B ') fl {A' n C) = {oxygen}. 

2.16 (a) MUN — {x\0<x< 9}. 

(b) M r N = {x \ 1 < x < 5}. 

(c) M' D N* = {x | 9 < x < 12}. 

2.17 A Venn diagram is shown next. 




/"C 

S 


t A 

1 \ B \ 


1 


\ 3 j 4 j 



(a) From the above Venn diagram, ( A fl B)' contains the regions of 1, 2 and 4. 

(b) (A U B)' contains region 1. 

(c) A Venn diagram is shown next. 



(A fl C) U B contains the regions of 3, 4, 5, 7 and 8. 

2.18 (a) Not mutually exclusive. 

(b) Mutually exclusive. 

(c) Not mutually exclusive. 

(d) Mutually exclusive. 

2.19 (a) The family will experience mechanical problems but will receive no ticket for 

traffic violation and will not arrive at a campsite that has no vacancies. 

(b) The family will receive a traffic ticket and arrive at a campsite that has no va- 
cancies but will not experience mechanical problems. 
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(c) The family will experience mechanical problems and will arrive at a campsite that 
has no vacancies. 

(d) The family will receive a traffic ticket but will not arrive at a campsite that has 
no vacancies. 

(e) The family will not experience mechanical problems. 

2.20 (a) 6; 

(b) 2; 

(c) 2, 5, 6; 

(d) 4, 5, 6, 8. 

2.21 With ni — 6 sightseeing tours each available on rt 2 = 3 different days, the multiplication 
rule gives njn 2 = (6) (3) = 18 ways for a person to arrange a tour. 

2.22 With rii = 8 blood types and n 2 = 3 classifications of blood pressure, the multiplication 
rule gives niri 2 = (8) (3) = 24 classifications. 

2.23 Since the die can land in n x = 6 ways and a letter can be selected in n 2 = 26 ways, the 
multiplication rule gives nin 2 = (6) (26) = 156 points in S. 

2.24 Since a student may be classified according to ri\ = 4 class standing and n 2 = 2 gender 
classifications, the multiplication rule gives nin 2 = (4) (2) = 8 possible classifications 
for the students. 

2.25 With rii = 5 different shoe styles in n 2 = 4 different colors, the multiplication rule 
gives riin 2 = (5) (4) = 20 different pairs of shoes. 

2.26 Using Theorem 2.8, we obtain the followings. 

(a) There are ( 5 ) =21 ways. 

(b) There are (:() = 10 ways. 

2.27 Using the generalized multiplication rule, there are rii xn 2 xn 3 xn 4 = (4) (3) (2) (2) = 48 
different house plans available. 

2.28 With ri\ = 5 different manufacturers, n 2 = 3 different preparations, and 72,3 = 2 
different strengths, the generalized multiplication rule yields nin 2 n 3 = (5) (3) (2) = 30 
different ways to prescribe a drug for asthma. 

2.29 With tii = 3 race cars, n 2 = 5 brands of gasoline, n 3 = 7 test sites, and n 4 = 2 drivers, 
the generalized multiplication rule yields (3) (5) (7) (2) = 210 test runs. 

2.30 With n\ = 2 choices for the first question, n 2 = 2 choices for the second question, 
and so forth, the generalized multiplication rule yields nin 2 • • - rig = 2 9 = 512 ways to 
answer the test. 
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2.31 (a) With ri\ = 4 possible answers for the first question, ri 2 = 4 possible answers 

for the second question, and so forth, the generalized multiplication rule yields 
4 5 = 1024 ways to answer the test. 

(b) With ri\ = 3 wrong answers for the first question, n-i = 3 wrong answers for the 
second question, and so forth, the generalized multiplication rule yields 

nrn 2 n 3 n 4 n 5 = (3) (3) (3) (3) (3) = 3 5 = 243 

ways to answer the test and get all questions wrong. 

2.32 (a) By Theorem 2.3, 7! = 5040. 

(b) Since the first letter must be m, the remaining 6 letters can be arranged in 6! = 720 
ways. 

2.33 Since the first digit is a 5, there are ri\ = 9 possibilities for the second digit and then 
n 2 = 8 possibilities for the third digit. Therefore, by the multiplication rule there are 
nin 2 = (9) (8) = 72 registrations to be checked. 

2.34 (a) By Theorem 2.3, there are 6! = 720 ways. 

(b) A certain 3 persons can follow each other in a line of 6 people in a specified order is 

4 ways or in (4) (3!) = 24 ways with regard to order. The other 3 persons can then 
be placed in line in 3! = 6 ways. By Theorem 2.1, there are total (24) (6) = 144 
ways to line up 6 people with a certain 3 following each other. 

(c) Similar as in (b), the number of ways that a specified 2 persons can follow each 
other in a line of 6 people is (5) (2!) (4!) = 240 ways. Therefore, there are 720 — 
240 = 480 ways if a certain 2 persons refuse to follow each other. 

2.35 The first house can be placed on any of the ri\ = 9 lots, the second house on any of the 
remaining n 2 = 8 lots, and so forth. Therefore, there are 9! = 362, 880 ways to place 
the 9 homes on the 9 lots. 

2.36 (a) Any of the 6 nonzero digits can be chosen for the hundreds position, and of the 

remaining 6 digits for the tens position, leaving 5 digits for the units position. So, 
there are (6) (5) (5) = 150 three digit numbers. 

(b) The units position can be filled using any of the 3 odd digits. Any of the remaining 

5 nonzero digits can be chosen for the hundreds position, leaving a choice of 5 
digits for the tens position. By Theorem 2.2, there are (3) (5) (5) = 75 three digit 
odd numbers. 

(c) If a 4, 5, or 6 is used in the hundreds position there remain 6 and 5 choices, 
respectively, for the tens and units positions. This gives (3) (6) (5) = 90 three 
digit numbers beginning with a 4, 5, or 6. If a 3 is used in the hundreds position, 
then a 4, 5, or 6 must be used in the tens position leaving 5 choices for the units 
position. In this case, there are (1)(3)(5) = 15 three digit number begin with 
a 3. So, the total number of three digit numbers that are greater than 330 is 
90 + 15 = 105. 
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2.37 The first seat must be filled by any of 5 girls and the second seat by any of 4 boys. 
Continuing in this manner, the total number of ways to seat the 5 girls and 4 boys is 
(5)(4)(4)(3)(3)(2)(2)(1)(1) — 2880. 

2.38 (a) 8! = 40320. 

(b) There are 4! ways to seat 4 couples and then each member of a couple can be 
interchanged resulting in 2 4 (4!) = 384 ways. 

(c) By Theorem 2.3, the members of each gender can be seated in 4! ways. Then 
using Theorem 2.1, both men and women can be seated in (4!) (4!) = 576 ways. 

2.39 (a) Any of the ri\ = 8 finalists may come in first, and of the n-z = 7 remaining finalists 

can then come in second, and so forth. By Theorem 2.3, there 8! = 40320 possible 
orders in which 8 finalists may finish the spelling bee. 

(b) The possible orders for the first three positions are 8-P3 = §} = 336. 

2.40 By Theorem 2.4, 8 P 5 = § = 6720. 

2.41 By Theorem 2.4, 6 P 4 = § = 360. 

2.42 By Theorem 2.4, 40 P 3 = f| = 59, 280. 

2.43 By Theorem 2.5, there are 4! = 24 ways. 

2.44 By Theorem 2.5, there are 7! = 5040 arrangements. 

2.45 By Theorem 2.6, there are 3^ = 3360. 

2.46 By Theorem 2.6, there are gjlj^ = 1260 ways. 

2.47 By Theorem 2.7, there are ( 7 3 2 9 ) = 7920 ways. 

2-48 (J,) + ( 2 ,y + (JJ + Oh) + ( 2 ,y = 4410. 

2.49 By Theorem 2.8, there are (3) = 56 ways. 

2.50 Assume February 29th as March 1st for the leap year. There are total 365 days in a 
year. The number of ways that all these 60 students will have different birth dates (i.e, 
arranging 60 from 365) is 365-P60- This is a very large number. 

2.51 (a) Sum of the probabilities exceeds 1. 

(b) Sum of the probabilities is less than 1. 

(c) A negative probability. 

(d) Probability of both a heart and a black card is zero. 


2.52 Assuming equal weights 
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(a) P(A) = 

(b) P(C) = 

(c) P(AnC) = |. 

2.53 S = {$10, $25, $100} with weights 275/500 = 11/20, 150/500 = 3/10, and 75/500 = 
3/20, respectively. The probability that the first envelope purchased contains less than 
$100 is equal to 11/20 + 3/10 = 17/20. 

2.54 (a) P(S n D') = 88/500 = 22/125. 

(b) P(EPDnS') = 31/500. 

(c) P(S'nE') = 171/500. 

2.55 Consider the events 

S: industry will locate in Shanghai, 

B : industry will locate in Beijing. 

(a) P(S n B) = P(S) + P(B) - P(S U B) = 0.7 + 0.4 - 0.8 = 0.3. 

(b) P(S' n B’) = 1 - P(S u B) - 1 - 0.8 - 0.2. 

2.56 Consider the events 

B\ customer invests in tax-free bonds, 

M : customer invests in mutual funds. 

(a) P(B U M) — P(B ) + P(M) - P(B Cl M) — 0.6 + 0.3 - 0.15 = 0.75. 

(b) P(B' n M') — 1 — P(B U M) = 1 - 0.75 = 0.25. 

2.57 (a) Since 5 of the 26 letters are vowels, we get a probability of 5/26. 

(b) Since 9 of the 26 letters precede j, we get a probability of 9/26. 

(c) Since 19 of the 26 letters follow g, we get a probability of 19/26. 

2.58 (a) Let A = Defect in brake system; B = Defect in fuel system; P(A U B) = P(A) + 

P(B) - P(A n B) = 0.25 + 0.17 - 0.15 = 0.27. 

(b) P(No defect) = 1 - P(A U B) = 1 - 0.27 = 0.73. 

2.59 By Theorem 2.2, there are N = (26)(25)(24)(9)(8)(7)(6) = 47,174,400 possible ways 
to code the items of which n = (5) (25) (24) (8) (7) (6) (4) = 4, 032, 000 begin with a vowel 
and end with an even digit. Therefore, -y = 

2.60 (a) Of the (6) (6) = 36 elements in the sample space, only 5 elements (2,6), (3,5), 

(4,4), (5,3), and (6,2) add to 8. Lienee the probability of obtaining a total of 8 is 
then 5/36. 

(b) Ten of the 36 elements total at most 5. Lienee the probability of obtaining a total 
of at most is 10/36=5/18. 
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2.61 Since there are 20 cards greater than 2 and less than 8, the probability of selecting two 
of these in succession is 

/20\ / 19\ _ 95 
\52 / \51 / _ 663' 


2.62 

(a) 

G)G) 

(S) 

1 

3 ■ 


(b) 

©(?) 

(S) 

_ 5 

14' 

2.63 

(a) 

«)( 4 2 8 ) 

(f) 

94 

54145 ' 


(b) 

(?)(?: 

(?) 

) _ 143 

39984 


2.64 Any four of a kind, say four 2’s and one 5 occur in ('^ = 5 ways each with probability 
(l/6)(l/6)(l/6)(l/6)(l/6) = (1/6) 5 . Since there are 6 -P 2 = 30 ways to choose various 
pairs of numbers to constitute four of one kind and one of the other (we use permutation 
instead of combination is because that four 2’s and one 5, and four 5’s and one 2 are 
two different ways), the probability is (5) (30) (1/6) 5 = 25/1296. 

2.65 (a) P(M U H) = 88/100 = 22/25; 

(b) P(M' n H') = 12/100 = 3/25; 

(c) P(H n M') = 34/100 = 17/50. 

2.66 (a) 9; 

(b) 1/9. 

2.67 (a) 0.32; 

(b) 0.68; 

(c) office or den. 

2.68 (a) 1 - 0.42 = 0.58; 

(b) 1 - 0.04 = 0.96. 

2.69 P(A) = 0.2 and P(B) = 0.35 

(a) P(A') = 1 - 0.2 = 0.8; 

(b) P(A' n B’) = 1 - P(A U B) = 1 - 0.2 - 0.35 = 0.45; 

(c) P(A U B) = 0.2 + 0.35 = 0.55. 

2.70 (a) 0.02 + 0.30 = 0.32 = 32%; 

(b) 0.32 + 0.25 + 0.30 = 0.87 = 87%; 
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(c) 0.05 + 0.06 + 0.02 = 0.13 = 13%; 

(d) 1 - 0.05 - 0.32 = 0.63 = 63%. 
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2.71 (a) 0.12 + 0.19 = 0.31; 

(b) 1 - 0.07 = 0.93; 

(c) 0.12 + 0.19 = 0.31. 

2.72 (a) 1 - 0.40 = 0.60. 

(b) The probability that all six purchasing the electric oven or all six purchasing the 
gas oven is 0.007 + 0.104 = 0.111. So the probability that at least one of each 
type is purchased is 1 — 0.111 = 0.889. 

2.73 (a) P(C ) = 1 - P(A) - P{B) = 1 - 0.990 - 0.001 = 0.009; 

(b) P(B') = 1 - P(B) = 1 - 0.001 = 0.999; 

(c) P(B) + P(C) = 0.01. 

2.74 (a) ($4.50 - $4.00) x 50, 000 = $25, 000; 

(b) Since the probability of underfilling is 0.001, we would expect 50, 000 x 0.001 = 50 
boxes to be underfilled. So, instead of having ($4.50 — $4.00) x 50 = $25 profit 
for those 50 boxes, there are a loss of $4.00 x 50 = $200 due to the cost. So, the 
loss in profit expected due to underfilling is $25 + $200 = $250. 

2.75 (a) 1 - 0.95 - 0.002 = 0.048; 

(b) ($25.00 - $20.00) x 10, 000 = $50, 000; 

(c) (0.05)(10, 000) x $5.00 + (0.05)(10, 000) x $20 = $12, 500. 

2.76 P(A'nB') = 1-P(AUB) = l-(P(A)+P(B)-P(Ar\B) = l+P(Ar\B)-P(A)-P(B). 

2.77 (a) The probability that a convict who pushed dope, also committed armed robbery. 

(b) The probability that a convict who committed armed robbery, did not push dope. 

(c) The probability that a convict who did not push dope also did not commit armed 
robbery. 

2.78 P(S | A) = 10/18 = 5/9. 

2.79 Consider the events: 

M : a person is a male; 

S: a person has a secondary education; 

C : a person has a college degree. 


(a) P(M | S) = 28/78 = 14/39; 

(b) P(C | M') = 95/112. 



Solutions for Exercises in Chapter 2 


21 


2.80 Consider the events: 

A: a person is experiencing hypertension, 
B: a person is a heavy smoker, 

C: a person is a nonsmoker. 


(a) P(A | B) = 30/49; 

(b) P(C | A') = 48/93 = 16/31. 

2.81 (a) P(Mn?nff) = | = |; 
(b) P(HC\M | P') = 

2.82 (a) (0.90)(0.08) = 0.072; 

(b) (0.90)(0.92)(0.12) = 0.099. 


22—10 12 3 

100-68 32 8' 


2.83 


(a) 

(b) 

(c) 

(d) 


0.018; 

0.22 + 0.002 + 0.160 + 0.102 + 0.046 + 0.084 
0.102/0.614 = 0.166; 

0.102+0.046 n A 7Q 

0.175+0.134 — U.4/y. 


0.614; 


2.84 Consider the events: 

C: an oil change is needed, 
F: an oil filter is needed. 


(a) P(F | C) = = 0.56. 

(b) P(C I F) = = 0,35. 

2.85 Consider the events: 

H: husband watches a certain show, 
W: wife watches the same show. 


(a) P(W n H) = P(W)P(H | W) = (0.5)(0.7) = 0.35. 

_ P(WDH) _ 0.35 _ 


= +/ = 0.875. 


(b) P(W | H) = 

(c) P(W U H) = P(W) + P(H) - P(W n H) = 0.5 + 0.4 - 0.35 = 0.55. 


2.86 Consider the events: 


H: the husband will vote on the bond referendum, 
W: the wife will vote on the bond referendum. 


Then P(H ) = 0.21, P(W) = 0.28, and P(H D W) = 


0.15. 


(a) P(H UW) = P(H) + P(W ) - P(H n W) = 0.21 + 0.28 - 0.15 


(b) P(W I H) = 


0.15 

0.21 


5 

T 


(c) P{H | W') 


P(Hnw') 

P{W) 


0.06 

0.72 


12 ’ 


0.34. 
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2.87 Consider the events: 

A: the vehicle is a camper, 

B: the vehicle has Canadian license plates. 


(a) P(B | A) = mop 


0.09 _ _ 9 _ 
0.28 28 ‘ 


(b) p(a i b) = 


0.09 _ 3 
0.12 4 ' 


(c) P(B' U A') = 1 - P(A n B) = 1 - 0.09 


0.91. 


2.88 Define 

H: head of household is home, 

C: a change is made in long distance carriers. 
P(HDC) = P(H)P(C | H) = (0.4)(0.3) = 0.12. 


2.89 Consider the events: 

A: the doctor makes a correct diagnosis, 

B: the patient sues. 

P(A' n B) = P(A')P(B I A') = (0.3)(0.9) = 0.27. 

2.90 (a) 0.43; 

(b) (0.53)(0.22) = 0.12; 

(c) 1 - (0.47)(0.22) = 0.90. 


2.91 Consider the events: 

A: the house is open, 

B\ the correct key is selected. 

P(A) = 0.4, P(A') = 0.6, and P(B) = ^Sp = | = 0.375. 

So, P[A U ( A ' n B )] = P(A) + P(A')P(B) = 0.4 + (0.6)(0.375) = 0.625. 

2.92 Consider the events: 

F: failed the test, 

P: passed the test. 


(a) 

P (failed at 

- least one tests) = 1 — P(P 1 P 2 P 3 P 4 ) 

= 1 - (0.99) (0.97) (0.98) (0.99) 


1-0.93 = 

0.07, 


(b) 

P (failed 2 

or 3) = P(P 4 )P(P 4 )( 1 - P(P 2 P 3 )) = 

(0.99)(0.99)(1 - (0.97)(0.98)) 


0.0484. 



(c) 

100 x 0.07 

= 7. 


(d) 

0.25. 




2.93 Let A and B represent the availability of each fire engine. 

(a) P(A ' n B ') = P(A')P(B') = (0.04) (0.04) = 0.0016. 

(b) P(A UB) = 1- P(A ' nB') = l- 0.0016 = 0.9984. 
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2.94 P{T' n N') = P{T')P{N') = (1 - P(T))( 1 - P(N )) = (0.3)(0.1) = 0.03. 

2.95 Consider the events: 

Ap. aspirin tablets are selected from the overnight case, 

A 2 : aspirin tablets are selected from the tote bag, 

L 2 : laxative tablets are selected from the tote bag, 

T\ : thyroid tablets are selected from the overnight case, 

T 2 : thyroid tablets are selected from the tote bag. 


(a) P(Ti n T 2 ) = P(T t )P(T 2 ) = (3/5)(2/6) = 1/5. 

(b) P(T[ n T' 2 ) = P(T[)P(%) = (2/5) (4/6) = 4/15. 

(c) l-P(A 1 nA 2 )-P(T l nT 2 ) = l-P(A 1 )P(A 2 )-P(T l )P(T 2 ) = 1 - (2/5)(3/6) 
(3/5)(2/6) = 3/5. 


2.96 Consider the events: 

X: a person has an A"-ray, 

C: a cavity is filled, 

T : a tooth is extracted. 

p(x n c n t) = p(x)p(c \ x)p(t \xn c) = (o.6)(o.3)(o.i) = o.ois. 

2.97 (a) P(Q 1 nQ 2 nQ 3 nQ A ) = P(Q 1 )P(Q 2 \Q 1 )P(Q 3 \Q 1 nQ 2 )P(Q 4 \Q 1 nQ 2 nQ 3 ) 

(15/20) (14/19) (13/18) (12/17) = 91/323. 

(b) Let A be the event that 4 good quarts of milk are selected. Then 


P(A) 



91 

323' 


2.98 P = (0.95) [1 - (1 - 0.7)(1 - 0.8)](0.9) = 0.8037. 

2.99 This is a parallel system of two series subsystems. 

(a) P = 1 - [1 - (0.7) (0.7)] [1 - (0.8)(0.8)(0.8)] = 0.75112. 

( b) p = F y , / Cnpn f ) = (as)( n 8 7 ) y,? (a8) = 0.2045. 

v / aPsystem works 0.75112 

2.100 Define S: the system works. 

p( Ai | oi\ _ PA'nS') _ P(A')(l-P(CnDDE)) _ (0.3)[1-(0.8)(0.8)(0.8)1 _ n coo 

r 1 I ° / P(S') l-P(S) 1-0.75112 o.ooo. 

2.101 Consider the events: 

C: an adult selected has cancer, 

D\ the adult is diagnosed as having cancer. 

P(C) = 0.05, P(D | C) = 0.78, P(C) = 0.95 and P(D \ C) = 0.06. So, P(D) 
P(C n D) + P(C CD) — (0.05)(0.78) + (0.95)(0.06) = 0.096. 
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2.102 Let Si, S 2 , S 3 , and S 4 represent the events that a person is speeding as he passes through 
the respective locations and let R represent the event that the radar traps is operating 
resulting in a speeding ticket. Then the probability that he receives a speeding ticket: 

P(R) = EP(R | Si)P(Si) = (0.4)(0.2) + (0.3)(0.1) + (0.2)(0.5) + (0.3)(0.2) = 0.27. 


2.103 P(C I D) = = S = 0-40625. 

2.104 P(S 2 | R) = P( g^f 2) = jgf = 1/9. 

2.105 Consider the events: 

A: no expiration date, 

By. John is the inspector, P(Bi) = 0.20 and P(A \ B\ 
By. Tom is the inspector, P(B 2 ) = 0.60 and P(A \ B 2 
By. Jeff is the inspector, P(B 3 ) = 0.15 and P(A j B 3 ) 
By. Pat is the inspector, P(B 4 ) = 0.05 and P(A \ B 4 ) 

p/p | A\ = (0.005)(0.20) 

V 1 I > (0.005) (0.20)+(0.010)(0.60)+(0.011)(0.15)+(0. 005) (0.05) 

2.106 Consider the events 

E: a malfunction by other human errors, 

A: station A, B: station B , and C : station C. 

p/ r I p) - p(f | C)P(C) 

r \ > P(E \)1\A)~P{E B)P{B)+P(E C)P(C) (" 7/1 

= 0.2632. 


| Bx) = 0.005, 
B 2 ) = 0.010, 
B 3 ) = 0.011, 
B 4 ) = 0.005, 
TTT^T = 0.1124. 


(5/10) (10/43) 

(7/18)(18/43)+(7/15)(15/43)+(5/10)(10/43) 


2.107 (a) P(A nBnC) = P(C \ A n B)P{B \ A)P(A) = (0.20)(0.75)(0.3) = 0.045. 

(b) p(B' n c) = P(A nB'nC) + P{A' nB'nc)= p{c \ A n b')p(B' \ A)P(A) + 
P(C I A'nB')P{B' I A')P(A') = (0.80) (1 — 0.75) (0.3) + (0.90) (1 — 0.20) (1 — 0.3) = 
0.564. 


(c) Use similar argument as in (a) and (b), P(C) = P(A fl B D C) + P(A D B' PC) + 
P{A' nBnC) + P(A' nB'nC) = 0.045 + 0.060 + 0.021 + 0.504 = 0.630. 

(d) P(A \ BTC) = P(A nB'n C)/P(B ' n C) = (0.06)(0.564) = 0.1064. 


2.108 Consider the events: 

A: a customer purchases latex paint, 

A': a customer purchases semigloss paint, 

B: a customer purchases rollers. 

p/ a | m PUj I dlTidl - (0-60)(0-"5) 

r \ u ) | a)P(A)+P(B | A')P(A') (0.60)(0.75)+(0.25)(0.30) 


0.857. 


2.109 Consider the events: 

G: guilty of committing a crime, 

/: innocent of the crime, 
v. judged innocent of the crime, 
g: judged guilty of the crime. 

p/T | n \ = P(9 I I)P(I) = (0.01)(0.95) 

1 y 1 \ y ) P(g | G)P(G)+P(g I I)P(I) (0.05) (0.90)+ (0.01) (0.95) 


0.1743. 
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2.110 Let Ai be the event that the ith patient is allergic to some type of week. 

(a) p(4 4 n A 2 n A% n A 4) -1 - p(4 3 n A 2 n A 3 n A 4) -1 - p(4 4 n A 2 n A 3 n A 4) + 
p(a[ n 4 2 n 4 3 n 4 4 ) = p(a 1 )p(a 2 )p(a 3 )p(a' 4 ) + p(a 1 )p(A 2 )p(A' 3 )p(A 4 ) + 
P(A 1 )P(A' 2 )P(A3)P(A 4 ) + P(4;)P(4 2 )P(4 3 )P(4 4 ) = (4) (1/2) 4 = 1/4. 

(b) P(A[ n 4; n 4; n a' 4 ) = p^p^p^p^) = (1/2) 4 = l/ie. 

2.111 No solution necessary. 


2.112 (a) 0.28 + 0.10 + 0.17 = 0.55. 

(b) 1-0.17 = 0.83. 

(c) 0.10 + 0.17 = 0.27. 


2113 P= mmmm. 


01 ) 


2.114 (a) P(M\ (~l M 2 (~l M 3 n M 4 ) = (O.l) 4 = 0.0001, where M* represents that ith person 
make a mistake. 

(b) P(J n c n R’ n W) = (o.i)(o.i)(o.9)(o.9) = o.oosi. 


2.115 Let R, S, and L represent the events that a client is assigned a room at the Ramada 
Inn, Sheraton, and Lakeview Motor Lodge, respectively, and let F represents the event 
that the plumbing is faulty. 


(a) P(F) = P(F | R)P(R) + P(F 
(0.04) (0.4) + (0.08)(0.3) = 0.054. 

(b) P(L | F) = ™ = l 


S)P(S) + P(F I L)P(L) = (0.05)(0.2) + 


2.116 (a) There are ('/) = 84 possible committees. 

(b) There are ( 1 ) (®) = 40 possible committees. 

(c) There are ( 4 ) (]) (®) = 15 possible committees. 

2.117 Denote by R the event that a patient survives. Then P(R) = 0.8. 


(a) p(r i n r 2 n r' 3 ) + p(r i n r ' 2 n R 3 )p{R ' 1 nR 2 n r 3 ) = P(i?i)P(i? 2 )P(R3) + 
P(R 1 )P(R' 2 )P(R 3 ) + P(P;)P(P 2 )P(P 3 ) = (3)(0.8)(0.8)(0.2) = 0.384. 

(b) P{R 1 n R 2 n P 3 ) = P(Pi)P(P 2 )P(P 3 ) = (0.8) 3 = 0.512. 


2.118 Consider events 

M : an inmate is a male, 

N: an inmate is under 25 years of age. 

P(M' n N') = P(M') + P(N') - P(M' U N') = 2/5 + 1/3 - 5/8 = 13/120. 

2.119 There are (3) (3) (®) = 800 possible selections. 
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2.120 Consider the events: 

B t : a black ball is drawn on the ith drawl. 

Gf a green ball is drawn on the ith drawl. 

(a) P(B 1 nB 2 n B 3 ) + P(G 1 n G 2 n G 3 ) = (6/10)(6/10)(6/10) + (4/10)(4/10)(4/10) = 
7/25. 

(b) The probability that each color is represented is 1 — 7/25 = 18/25. 

2.121 The total number of ways to receive 2 or 3 defective sets among 5 that are purchased 

b® O + ©0 =288. 

2.122 A Venn diagram is shown next. 



(a) (An B)': 1, 2, 3, 6, 7, 8. 

(b) (AUB)': 1, 6. 

(c) (AnC)UB: 3, 4, 5, 7, 8. 

2.123 Consider the events: 

O: overrun, 

A: consulting firm A, 

B : consulting firm B, 

C: consulting firm C. 

( a \ p(r I n\ - P(Q | C)P(C) _ (0.15)(0.25) 

w P(0 I 1 )P{A)+P(0 B)P{B)+P(0 C)P(C) (0.05)(0.40)+(0.03)(0.35)4-(0.1S) (0.25) 

0.0375 n cci c 

0.0680 U-OOI0. 

(b) P(A | O) = = 0.2941. 

2.124 (a) 36; 

(b) 12; 

(c) order is not important. 

2.125 (a) vAy = 0.0016; 

< b > = 1 = 0-«71. 
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2.126 Consider the events: 

C\ a woman over 60 has the cancer, 

P : the test gives a positive result. 

So, P(C ) = 0.07, P{P' | C) = 0.1 and P(P j C') = 0.05. 

P(C I P'\ — P(P' | C)P(C) _ (0.1)(0.07) 0.007 n nn+cc 

+ I / P(P' I C)P{C)+P{P' 0')P(C') (0.1)(0.07)+(1— 0.05)(1— 0.07) 0.8905 u.uuiou. 

2.127 Consider the events: 

A: two nondefective components are selected, 

N: a lot does not contain defective components, P(N ) = 0.6, P(A \ N ) = 1, 

O: a lot contains one defective component, P(0) = 0.3, P(A \ O ) = 


T: a lot contains two defective components, P(T) = 0.1, P(A \ T) = 


(?) 

( 2 2 °) 

(?) 

(?) 


153 

190’ 


(a) 

P(N\ 

A) 


_ 0.6 



0.9505 

(b) 

P(0 

A) 

(c) 

P(T 

A): 


P(A | N)P(N) 


( 1 )( 0 . 6 ) 


(l)(0.6)+(9/10)(0.3)+(153/190)(0.1) 


0.9505 

-0.6312 


0.2841 = 0.0847. 


2.128 Consider events: 

D: a person has the rare disease, P(D ) = 1/500, 

P: the test shows a positive result, P(P | D) = 0.95 and P(P \ D ') = 0.01. 
p/ D | p\ = P(P I D)P(D) _ (0.95) (1/500) 


P(P I D)P(D)+P(P I D')P(D') (0.95) (1/500)+ (0.01) (1—1/500) 


= 0.1599. 


2.129 Consider the events: 

1: engineer 1, P( 1) = 0.7, and 2: engineer 2, P( 2) = 0.3, 

P: an error has occurred in estimating cost, P(E j 1) = 0.02 and P(E j 2) = 0.04. 

pn | = PIP | l)P(l) _ (0.02)(0.7) 


p(e | i)P(i)+P(P | 2)P(2) (o.o2)(o.7)+(o.o4)(o.3) 0.5385, and 

P( 2 j P) = 1 — 0.5385 = 0.4615. So, more likely engineer 1 did the job. 


2.130 Consider the events: D : an item is defective 


(a) P(P!P 2 P 3 ) = P(Pi)P(P 2 )P(P 3 ) = (0.2) 3 = 0.008. 

(b) P(three out of four are defectives) = (g)(0.2) 3 (l — 0.2) = 0.0256. 

2.131 Let A be the event that an injured worker is admitted to the hospital and N be the event 
that an injured worker is back to work the next day. P(A) = 0.10, P(iV) = 0.15 and 
P(ADN) = 0.02. So, P(dUiV) = P(A) + P(N)-P(ADN) = 0.1 + 0.15-0.02 = 0.23. 

2.132 Consider the events: 

T : an operator is trained, P(T) = 0.5, 

M an operator meets quota, P(M | T) = 0.9 and P(M | T') = 0.65. 

P(T I A/t\ P(M | T)P(T) (0.9)(0.5) a ror|7 

\ 1V1 ) P{M \ T)P(T)+P(M | T')P(T') (0.9) (0.5)+ (0.65) (0.5) O.OOU/ . 
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2.133 Consider the events: 

A: purchased from vendor A, 

D: a customer is dissatisfied. 

Then P(A) = 0.2, P(A j D ) = 0.5, and P(D) = 0.1. 

So, P(D | A) = P(A j °2f {D) = (a5 0 K 2 ai) = 0.25. 

2.134 (a) P (Union member | New company (same field)) = yyyio = = 0.5652. 

(b) P(Unemployed j Union member) = = ^ = 0.034. 

2.135 Consider the events: 

C: the queen is a carrier, P(C) = 0.5, 

D : a prince has the disease, P(D \ C) = 0.5. 

p(r I ri n' n'\ — /^uybiy c)r(o (o.5) 3 (o.5) _ i 

' I 1 2 3l p(d[d' 2 d' 3 I C)P{C)+P(d' 1 d' 2 d' 3 I C')P(C’) (0.5) 3 (0.5)+l(0.5) 9' 

2.136 Using the solutions to Exercise 2.50, we know that there are total 365 P 60 ways that no 
two students have the same birth date. Since the total number of ways of the birth 
dates that 60 students can have is 365 60 , the probability that at least two students 
will have the same birth date in a class of 60 is P = 1 — 3 |gffo° • To compute this 
number, regular calculator may not be able to handle it. Using approximation (such 
as Stirling’s approximation formula), we obtain P = 0.9941, which is quite high. 



Chapter 3 

Random Variables and Probability 
Distributions 


3.1 Discrete; continuous; continuous; discrete; discrete; continuous. 

3.2 A table of sample space and assigned values of the random variable is shown next. 


Sample Space 

X 

NNN 

0 

NNB 

1 

NBN 

1 

BNN 

1 

NBB 

2 

BNB 

2 

BBN 

2 

BBB 

3 


3.3 A table of sample space and assigned values of the random variable is shown next. 


Sample Space 

w 

HHH 

3 

HHT 

1 

HTH 

1 

THH 

1 

HTT 

-1 

THT 

-1 

TTH 

-1 

TTT 

-3 


3.4 S = {HHH, THHH, HTHHH, TTHHH , TTTHHH , HTTHHH, THTHHH , 

H HTHH H, . . . }; The sample space is discrete containing as many elements as there 
are positive integers. 
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3 

3.5 (a) c = 1/30 since 1 = c i x2 + 4) = 30c. 

x=0 

(b) c = 1/10 since 



10c. 


3.6 


(a) 


P{X > 200) = 


r°° 20000 

J 200 (a+100) 3 


dx 


10000 

(a+100)- 


oo 

200 


1 

9 ' 


(b) 


P(80 < X < 200) = 


r!20 20000 

J 80 ( lc + 100) 3 


dx 


10000 

(a+100) 2 


120 

80 


1000 

9801 


0 . 1020 . 


3.7 (a) P(X < 1.2) = /g 1 x dx + / 1 L2 ( 2 - x) dx = f * + fex - f ) 


1 1.2 


= 0 . 68 . 


(b) P(0.5 < X < 1) = f 0 l 5 x dx = ^ 


= 0.375. 


0.5 


3.8 Referring to the sample space in Exercise 3.3 and making use of the fact that P(H ) = 
2/3 and P(T) = 1/3, we have 
P(W = -3) = P(TTT) = (1/3) 3 = 1/27; 

P(W = -1) = P(HTT) + P(THT ) + P(TTH ) = 3(2/3)(l/3) 2 = 2/9; 

P(W = 1) = P(HHT) + P(HTH) + P(THH ) = 3(2/3) 2 (l/3) = 2/9; 

P(W = 3) = P(HHH) = (2/3) 3 = 8/27; 

The probability distribution for W is then 


w 

-3-11 3 

P(W = w) 

1/27 2/9 2/9 8/27 


3.9 (a) P(0 < X < 1) = Jo dx = 
(b) P( 1/4 < A' < 1/2) = J, 1 /; 


= 1 . 


_ f l/2 2(01+2) d = (x+2) 2 
5 5 


1/2 

1/4 


19/80. 


3.10 The die can land in 6 different ways each with probability 1/6. Therefore, f(x) = 
for x = 1, 2, . . . , 6. 

3.11 We can select x defective sets from 2, and 3 — x good sets from 5 in ( 2 ) ( 3 / r ) ways. A 
random selection of 3 from 7 sets can be made in (0 ways. Therefore, 

( 2 )( 5 ) 

/(*) = - 7 #=< * = 0 , 1 , 2 . 

\ 3 / 


X 

0 1 

2 

f( x ) 

2/7 4/7 

1/7 


In tabular form 


dll - 1 
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The following is a probability histogram: 

4/7 

3/7 

2/7 

1/7 

1 2 3 

x 


3.12 (a) P(T = 5) = F( 5) - F( 4) = 3/4 - 1/2 = 1/4. 

(b) P(T > 3) = 1 - F(3) = 1 - 1/2 = 1/2. 

(c) P(1.4 < T < 6) = F( 6) - F(1.4) = 3/4 - 1/4 = 1/2. 

3.13 The c.d.f. of X is 

0, for x < 0, 

0.41, for 0 < x < 1, 

0.78, for 1 < x < 2, 

0.94, for 2 < x < 3, 

0. 99, for 3 < x < 4, 

1, for x > 4. 

3.14 (a) P(X < 0.2) = F(0.2) = 1 - e" 1 ' 6 = 0.7981; 

(b) f(x) = F'(x) = 8e~ Sx . Therefore, P( X < 0.2) = 8 / 0 °’ 2 e~ 8x dx = — 8a: |°' 2 
0.7981. 

3.15 The c.d.f. of X is 

0, for x < 0, 

2/7, for 0 < x < 1, 

6/7, for 1 < x < 2, 

1, for x >2. 

(a) P(X = 1) = P(X < 1) - P(X < 0) = 6/7 - 2/7 = 4/7; 

(b) P(0 <X<2) = P{X < 2) - P(X < 0) = 1 - 2/7 = 5/7. 
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3.16 A graph of the c.d.f. is shown next. 


1 


6/7 


5/7 


F(x) 


3/7 


2/7 < 


1/7 



o 1 2 


X 


3.17 


3.18 


(a) Area = J ] 3 (l/2) dx = ||^ = 1. 

(b) P { 2 < X < 2.5) / 2 2 ' 5 (l/2) dx= f | 2 ' 

(c) P(X< 1.6) = / 1 16 (l/2) f|;- 6 


(a) P(X < 4) = f 2 4 dx : 

(b) P( 3 <X<4) = / 3 4 ^±£) 


(i+z ) 2 4 _ 
27 2 

dx = 


i 

4 " 

0.3. 

16/27. 



3.19 F(x) = /*( 1/2) dt = 

P(2 < X < 2.5) = P(2.5) - P(2) = M _ 1 = 1. 


3.20 P(x) = A J x ( 1 + t) dt = (t + yj 

p(3 < x < 4) = f(4) - f(3) = m _ mm = i 


(x+4)(x— 2) 
— 27 ! 


3.21 (a) 1 = k J‘1 ^/x dx = ^x 3 / 2 ^ = ijb Therefore, k — §. 

(b) P(x) = f /p \/t dt — f 3 / 2 = a; 3 / 2 . 

P(0.3 < X < 0.6) = F(0.6) - P(0.3) = (0.6) 3 / 2 - (0.3) 3 / 2 = 0.3004. 


3.22 Denote by A" the number of spades int he three draws. Let S and N stand for a spade 
and not a spade, respectively. Then 

P(X = 0) = P(NNN) = (39/52) (38/51) (37/50) = 703/1700, 

P(X = 1) = P(SNN) + P(NSN) + P(NNS) = 3(13/52) (39/51) (38/50) = 741/1700, 
P(X = 3) = P(SSS) = (13/52) (12/51) (11/50) = 11/850, and 
P(X = 2) = 1 - 703/1700 - 741/1700 - 11/850 = 117/850. 

The probability mass function for X is then 


X 

0 12 3 

f(x) 

703/1700 741/1700 117/850 11/850 
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3.23 The c.d.f. of X is 


0 , 

for w < —3, 

1/27, 

for —3 < w < — 

7/27, 

for — 1 < w < 1 , 

19/27, 

for 1 < w < 3, 

1 , 

for w > 3, 


(a) P(W >0 = 1 - P(W < 0 ) = 1 - 7/27 = 20/27. 

(b) P(- 1 <W<3) = F( 2) - F(— 3) = 19/27 - 1/27 = 2/3. 


3.24 There are 
and 4 — x 


( 4 °) ways of selecting any 4 CDs from 10. We can select x jazz CDs from 5 
from the remaining CDs in (/) ( 4 /_J ways. Hence 


f(x) = 


CHA) 

(?) 


x — 0,1, 2, 3, 4. 


3.25 Let T be the total value of the three coins. Let D and N stand for a dime and nickel, 
respectively. Since we are selecting without replacement, the sample space containing 
elements for which t = 20, 25, and 30 cents corresponding to the selecting of 2 nickels 

PH 4 ) 

and 1 dime, 1 nickel and 2 dimes, and 3 dimes. Therefore, P(T = 20) = = > 

P(T = 25) = S^- = §, 

PIT = 30) = SI = i, 

and the probability distribution in tabular form is 


t 

20 

25 

30 

P(T = t) 

1/5 

3/5 

1/5 


As a probability histogram 
3/5 I 1 


2/5 

5? 

1/5 


20 


25 

X 


30 



34 


Chapter 3 Random Variables and Probability Distributions 


3.26 Denote by X the number of green balls in the three draws. Let G and B stand for the 
colors of green and black, respectively. 


Simple Event 

X 

P(X = x ) 

BBB 

0 

(2/3) 3 = 8/27 

GBB 

1 

( 1/3) (2/3) 2 = 4/27 

BGB 

1 

( 1/3) (2/3) 2 = 4/27 

BBG 

1 

( 1/3) (2/3) 2 = 4/27 

BGG 

2 

(l/3) 2 (2/3) = 2/27 

GBG 

2 

(l/3) 2 (2/3) = 2/27 

GGB 

2 

(l/3) 2 (2/3) = 2/27 

GGG 

3 

(1/3) 3 = 1/27 


The probability mass function for X is then 


X 

0 12 3 

P(X = x) 

8/27 4/9 2/9 1/27 


3.27 (a) For x > 0, F{x) = exp(— f/2000) dt = — exp(— £/2000)|g 

= 1 — exp(— x/2000). So 


F(x) 


0, x < 0, 

1 — exp(— rr/2000), x > 0. 


(b) P(X > 1000) = 1 - F(1000) = 1 - [1 - exp(— 1000/2000)] = 0.6065. 

(c) P(X < 2000) = F(2000) = 1 - exp(-2000/2000) = 0.6321. 


3.28 

(a) 

f(x. 

) > 0 and f 2 f 7 f f dx = 

2,1 26.25 _ 
5 1 23.75 

_ 2.5 . 
2.5 

= 1. 


(b) 

P( X < 24) = ;“ 75 1 dx = 

§(24- 23.75) 

= 0.1. 


(c) 

P(X > 26) = 25 1 dx = 

: |(26.25- 

-26) 

= 0.1. It is not extremely rare. 

3.29 

(a) 

f(x 

) > 0 and J] 00 3:/' 4 dx ■ 

Q X ~ 3 

_ 6 3 

00 

1 

1. So, this is a density function. 


(b) 

For 

x > 1, F(x) — // 3 t~ A 

dt = 1 — 

a: -3 . 

So, 





fo 


x < 1, 




F{x) = ^ i 

— x~ 

" 3 , x > 1. 


(c) 

P(X > 4) = 1 - F(4) = V 

- 3 = 0.0156. 


3.30 

(a) 

1 = 

k f * 1 ( 3 — x 2 ) dx = k ( 

3x - t) 

i 

-l 

fk. So, k=±. 
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(b) For -1 < x < 1, F(x) = ^ ^(3 - t 2 ) = (3t - 1 t 3 ) = \ + - f^. 

So,P(*<i) = ±-(£) (ij-i(i) 3 =^- 

(c) P(|X| < 0.8) = P(X < -0.8) + P(X > 0.8) = P(— 0.8) + 1 - P(0.8) 

= 1 + (I _ A 0 .8 + ^0.8 3 ) - Q + ^0.8 - ^0.8 3 ) = 0.164. 

3.31 (a) For y > 0, F(y) = \ f’ J e "*/ 4 dy = 1 - e^ 4 . So, P(y > 6) = e“ 6 / 4 = 0.2231. This 

probability certainly cannot be considered as “unlikely.” 

(b) P(Y < 1) = 1 — e -1 / 4 = 0.2212, which is not so small either. 

3.32 (a) f(y) > 0 and f ( ] 5(1 — y) 4 dy = — (1 — y) b \\ = 1. So, this is a density function. 

(b) P(Y < 0.1) = - (1 - yK 1 = 1 - (1 - 0.1) 5 = 0.4095. 

(c) P(Y > 0.5) = (1 - 0.5) 5 = 0.03125. 

3.33 (a) Using integral by parts and setting 1 = k y 4 (l — y) 3 dy, we obtain k = 280. 

(b) For 0 < y < 1, F(y) = 56y 5 (l — Y) 3 + 28y 6 (l — y) 2 + 8y 7 (l — y) + y 8 . So, 
P(Y < 0.5) = 0.3633. 

(c) Using the cdf in (b), P(Y > 0.8) = 0.0563. 

3.34 (a) The event Y = y means that among 5 selected, exactly y tubes meet the spec- 

ification (M) and 5 — y (M') does not. The probability for one combination of 
such a situation is (0.99) y (l — 0.99) 5 ~ y if we assume independence among the 
tubes. Since there are y \^L y y permutations of getting y Ms and 5 — y M's , the 
probability of this event (Y = y) would be what it is specified in the problem. 

(b) Three out of 5 is outside of specification means that Y = 2. P(Y = 2) = 9.8 x 10~ 6 
which is extremely small. So, the conjecture is false. 

g 

3.35 (a) P(X > 8) = 1 - P(X < 8) = E e“ 6 f- = e" 6 (§ + ^ + • • • + f ) = 0.1528. 

(b) P(X = 2) = e“ 6 f = 0.0446. 

3.36 For 0 < x < 1, F(x) = 2 f*( 1 — t) dt — — (1 — t) 2 / = 1 — (1 — x ) 2 . 


(a) P(X < 1/3) = 1 - (1 - 1/3) 2 = 5/9. 

(b) P(X > 0.5) = (1 - 1/2) 2 = 1/4. 

(c) P(X < 0.75 | X > 0.5) = P( ° P (gg) 75) 


( 1 — 0 . 5) 2 — ( 1 — 0 . 75) 2 _ 3 
( 1 - 0 . 5) 2 — 4 ' 


3 3 3 3 

3.37 (a) E E f{ x iU) = c E E X V = 36c = 1. Hence c = 1/36. 

x=0 y = 0 x=0 y = 0 

(b) E E f( x , v) = c E E \ x -*y\ = 15c = 1 - Hence c = V 15 - 

x y x y 


3.38 The joint probability distribution of (. X , Y) is 
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f(x,y) 

X 

0 12 3 

0 

0 1/30 2/30 3/30 

y 1 

1/30 2/30 3/30 4/30 

2 

2/30 3/30 4/30 5/30 


(a) P(X < 2, Y = 1) = /( 0, 1) + /( 1, 1) + / (2, 1) = 1/30 + 2/30 + 3/30 = 1/5. 

(b) P(X > 2, 1" < 1) = /( 3, 0) + /(3, 1) = 3/30 + 4/30 = 7/30. 

(c) P(X > Y) = /( 1, 0) + / (2, 0) + / (3, 0) + / (2, 1) + / (3, 1) + /(3, 2) 

= 1/30 + 2/30 + 3/30 + 3/30 + 4/30 + 5/30 = 3/5. 

(d) P(X + Y = 4) = /( 2, 2) + /(3, 1) = 4/30 + 4/30 = 4/15. 

3.39 (a) We can select x oranges from 3, y apples from 2, and 4 — x — y bananas from 3 
in Q) Q) (a-x - y) ways- A random selection of 4 pieces of fruit can be made in ( 3 ) 
ways. Therefore, 


/SW2W 3 \ 

\xJ \y / \4—x—yd 


x = 0, 1,2,3; y = 0,1,2; 1 < x + y < 4. 


(b) p[(x, y) e A] = P(x + y < 2) = /( 1, 0) + /(2, 0) + /(o, 1) + /( 1, 1) + /( 0, 2) 

= 3/70 + 9/70 + 2/70 + 18/70 + 3/70 = 1/2. 

3.40 (a) g{x) = | f*(x + 2y) dy = |(x + 1), for 0 < x < 1. 

(b) /i(y) = | f 0 \x + 2y) dy = |(1 + Ay), for 0 < y < 1. 

(c) P(X < 1/2) = | J^ 2 (x + 1) dx = dd. 

3-41 (a) /’(-V + V < 1/2) = / 0 1/2 / 0 1/2_ “ 24xs dx dy = 12 / 0 1/2 (1 - j/) 2 // dy = 

(b) y(x) = J 0 ' x 24xy dy = 12x(l — x) 2 , for 0 < x < 1. 

( c ) /(dk) = 12^)2 = (t |)2 , for 0 < y < 1 - x. 

Therefore, P(y < 1/8 | X = 3/4) = 32 f Q 1/8 y dy = 1/4. 

3.42 Since /i(y) = e~ y e~ x dx = e _y , for y > 0, then /(x|y) = f(x,y)/h(y) = e~ x , for 
x > 0. So, P(0 < X < 1 | y = 2) = /p e“ x dx = 0.6321. 

3.43 (a) P(0 < X < 1/2, 1/4 < Y < 1/2) = f Q 1/2 J] 1 / 2 4 xy dy dx = 3/8 f Q 1/2 x dx = 3/64. 

(b) P(X < y) = /q 1 Jjf 4xy dx dy = 2 y 3 dy = 1/2. 

3.44 (a) 1 = k / 3 5 q ° J^q(x 2 + y 2 ) dx dy = k{ 50 — 30) ^ J 3 5 0 °x 2 dx + / 3 5 0 °y 2 dyj = . 10 4 . 

So, k = 3I2 • 10 -4 . 
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(b) P( 30 < X < 40, 40 < Y < 50) = ^ • 10~ 4 f™(x 2 + y 2 ) dy dx 

= afe ■ dx + /“ 9 2 dy) = i- 2 - 1(T= 

(c) P( 30 < X < 40, 30 < Y < 40) = J2 • 10~ 4 f^(x 2 + y 2 ) dx dy 

= 2J, ■ 10- 4 (40 - 30) / 4 V dx = &. lO- 4 ^ 


49 

196 ' 


3.45 P(X + Y > 1/2) = 1 - P{X + Y< 1/2) = 1 - f Q 1/4 f 4/2 x ± dy dx 

= 1 — f 0 1/fl [in (| — x) — In a:] da: = 1 + [Q — x) In (| — x) — x lnx] \ 4 J 4 
= 1 + ilnQ) = 0.6534. 


3.46 (a) From the column totals of Exercise 3.38, we have 

x 0 1 2 3 

g(x) 1/10 1/5 3/10 2/5 

(b) From the row totals of Exercise 3.38, we have 

y I 0 1 2 

h{y) 1/5 1/3 7/15 


3.47 (a) g(x) =2 f 1 dy = 2(1 — x) for 0 < x < 1; 
h(y) = 2 $ dx = 2 y, for 0 < y < 1. 

Since f(x,y) 7^ g(x)h(y), X and Y are not independent. 

(b) }{x\y) = f(x,y)/h(y) = 1/y, for 0 < x < y. 

Therefore, P(l/4 < X < 1/2 | 1" = 3/4) = f J 4 ) 2 dx=\. 


3.48 (a) g{ 2) = £ /( 2, y) = /( 2, 0) + /( 2, 1) + /( 2, 2) = 9/70 + 18/70 + 3/70 = 3/7. So, 
y = 0 

}{y\2) = /(2, y)/g(2) = (7/3)/(2, y). 

/( 0|2) = (7/3)/ (2, 0) = (7/3) (9/70) = 3/10, /( 1|2) = 3/5 and /( 2|2) = 1/10. In 
tabular form, 

y I 0 1 2 

JW) 3/10 3/5 1/10 
(b) P(y = 0 I X = 2) = /(0|2) = 3/10. 


3.49 


(a) 

X 

1 

2 

3 

g(x) 

0.10 

0.35 

0.55 

(b) 

y 

1 

2 

3 

%) 

0.20 

0.50 

0.30 

(c) P(Y = 

= 3 | X 

= 2) 

0.2 

0.05+0.10+0.20 


0.5714. 
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x 



2 4 

%) 

1 

0.10 0.15 

0.25 

y 3 

0.20 0.30 

0.50 

5 

0.10 0.15 

0.25 

g(x) 

0.40 0.60 



X 

2 

4 


g(x) 

0.40 

0.60 


y 

1 

3 

5 

h(y) 

0.25 

0.50 

0.25 


3.51 (a) Let X be the number of 4’s and Y be the number of 5’s. The sample space 
consists of 36 elements each with probability 1/36 of the form (■ m,n ) where 
m is the outcome of the first roll of the die and n is the value obtained on 
the second roll. The joint probability distribution f(x,y ) is defined for x = 
0,1,2 and y = 0,1,2 with 0 < x + y < 2. To find /( 0,1), for example, 
consider the event A of obtaining zero 4’s and one 5 in the 2 rolls. Then 
A = {(1,5), (2, 5), (3, 5), (6, 5), (5, 1), (5, 2), (5, 3), (5, 6)}, so /( 0, 1) = 8/36 = 2/9. 
In a like manner we find /( 0,0) = 16/36 = 4/9, /( 0,2) = 1/36, /(1,0) = 2/9, 
/(2, 0) = 1/36, and /(1,1) = 1/18. 

(b) P[(X,Y) E A] = P(2X + Y < 3) = /( 0,0) + /( 0,1) + /( 0,2) + /( 1,0) = 
4/9 + 1/9 + 1/36 + 2/9 = 11/12. 


3.52 A tabular form of the experiment can be established as, 


Sample Space 

X 

y 

HHH 

3 

3 

HHT 

2 

1 

HTH 

2 

1 

THH 

2 

1 

HTT 

1 

-1 

THT 

1 

-1 

TTH 

1 

-1 

TTT 

0 

-3 


So, the joint probability distribution is, 


f(x,y) 

X 

0 12 3 

-3 

1/8 

y -i 

3/8 

l 

3/8 

3 

1/8 
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3.53 (a) If (x,y) represents the selection of x kings and y jacks in 3 draws, we must have 
x — 0, 1, 2, 3; y — 0, 1,2,3; and 0 < x + y < 3. Therefore, (1,2) represents the 
selection of 1 king and 2 jacks which will occur with probability 


/( 1 , 2 ) 



6 

55 


Proceeding in a similar fashion for the other possibilities, we arrive at the follow- 
ing joint probability distribution: 


x 


f(x,y) 

0 12 3 

0 

1/55 6/55 6/55 1/55 

y i 

6/55 16/55 6/55 

2 

6/55 6/55 

3 

1/55 


(b) P[(X, Y ) E A] = P(X + Y > 2) = 1 - P(X + Y < 2) = 1 - 1/55 - 6/55 - 6/55 = 
42/55. 

3.54 (a) P(H) = 0.4, P(T) = 0.6, and S = {HH, HT,TH,TT}. Let (IP, Z) represent 
a typical outcome of the experiment. The particular outcome (1, 0) indicating a 
total of 1 head and no heads on the first toss corresponds to the event TH . There- 
fore, /( 1,0) = P(W = \,Z = 0) = P(TH) = P(T)P(H) = (0.6)(0.4) = 0.24. 
Similar calculations for the outcomes (0, 0), (1, 1), and (2, 1) lead to the following 
joint probability distribution: 


w 


f(w,z) 

0 

1 

2 

z 0 

0.36 

0.24 


1 


0.24 

0.16 


(b) Summing the columns, the marginal distribution of IP is 


w 

0 

1 

2 

g(w) 

0.36 

0.48 

0.16 


(c) Summing the rows, the marginal distribution of Z is 


z 

0 

1 

h(z) 

0.60 

0.40 


(d) P(W > 1) = /( 1, 0) + /( 1, 1) + /( 2, 1) = 0.24 + 0.24 + 0.16 = 0.64. 
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3.55 g(x) = | f 4 ( 6 — x — y) dy — for 0 < x < 2. 

So. f(y\x) = 0$ = yof, for 2 < y < 4, 

and P(1 < Y < 3 | X = 1) = \ f 2 ( 5 - y) dy = |. 

3.56 Since /( 1, 1) ^ g(l)h(l), the variables are not independent. 


3.57 X and Y are independent since f(x,y ) = g(x)h(y) for all (x,y). 

3.58 (a) h(y) = 6 f 4 ~ y x dx = 3(1 - y) 2 , for 0 < y < 1. Since f(x\y) = for 

0 < x < 1 — y, involves the variable y, X and Y are not independent. 

(b) P(X > 0.3 | Y = 0.5) = 8 J 0 ° 3 5 x dx = 0.64. 

3.59 (a) 1 = k f 4 f 4 J Q 2 xy 2 z dx dy dz = 2k f 4 J 0 ' y 2 z dy dz — y f 4 z dz = |. So, k — 3. 

(b) P (X < i, Y > 1 < Z < 2) = 3 f 0 1/4 fy 2 I, 2 xy 2 z dx dy dz = | f ( 4 ^ 4 fy 2 y 2 z dy dz 

= 2i rV4 , 21 _ 

16 JO 512’ 

3.60 <?(x) = 4 /gOn/ dy = 2x , for 0 < x < 1; h(y) = 4 f () l xy dx = 2 y, for 0 < y < 1. Since 
f(x,y ) = g(x)h{y ) for all (x,y), X and Y are independent. 

3.61 #(x) = k f^(x 2 + y 2 ) dy = k [x 2 y + y j = k (20x 2 + 98,9 00 ) , and 
h(y) = k(20y 2 +™f°). 

Since f(x,y ) 7^ g(x)h(y), X and Y are not independent. 

3.62 (a) g(y, z) — | f ( Q 1 x?/z 2 dx = |yz 2 , for 0 < y < 1 and 0 < z < 3. 

(b) %) = § / 0 3 yz 2 dz = 2j/, for 0 < y < 1. 

(c) P Q < X < Y > i, Z < 2) = | J‘, 2 J//3 ./'/^ x?/z 2 dx dy dz = 

(d) Since /(x|y, z) = = 2x, for 0 < x < 1, P (0 < X < | [ Y = |, Z = 2) = 

2 f 4// ' 2 x dx — 

3.63 g(x) = 24 f 4 ~ x xy dy = 12x(l — x) 2 , for 0 < x < 1. 


(a) P( X > 0.5) = 12 Jq 5 x( 1 — x) 2 dx = f 4 5 ( 12x — 24x 2 + 12x 3 ) dx 

(b) h(y) = 24 J 4 ~ y xy dx = I2y(l — y) 2 , for 0 < y < 1. 


(c) f{x\y) = 

So, P (X < | 


~ 12 y(l-y ) 2 ~ (l-l ) 2 , for 0 < X < 1 y. 
r = !)=/„‘ /8 T7fe* = 32 /o 1/8 = 0.25. 


_ 5 _ 

16 


0.3125. 
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3.65 (a) g{x) = f 0 °° ye^ 1 ^ dy = - ^ye^^ |~ + ^ / 0 °° e"* 1 ^ dy 

= 2_ P -v( i+*)|°° 

(l+.r) 2 e lo 

= for x > °- 

h(y) = ye~ y / 0 °° e _ya: dx = — e _y e - 2 /x|^° — e -y , for y > 0. 

(b) P(A > 2, y > 2) = f 2 °° J 0 °° ye _2/ ( 1+a: ) dx dy = — / 2 °° e _2/ dy = J 2 °° e~ 3y dy 

= _ ip-syl 00 = J_ 

3 C 1 2 3e 6 ' 

3.66 (a) P (A" < §, y < |) = | / 0 1/2 Jq^ 2 (x 2 + y 2 ) etedy = § / Q 1/2 (x 2 y + ^ dx 


3.66 (a) P (A < §, y < |) = | f 0 1/2 f 0 1/2 (x 2 + y 2 ) dxdy = § / Q 1/2 (x 2 y + 
= ! f 0 1/2 i x2 + Tl) dx = TC 


(b) P (A > |) = | (x 2 + |) dx = ||. 


X 

0 

1 

2 

3 

4 

5 

6 

f( x ) 

0.1353 

0.2707 

0.2707 

0.1804 

0.0902 

0.0361 

0.0120 


q (o') 

W /(x) 0.1353 0.2707 0.! 

(b) A histogram is shown next. 



1 2 3 4 5 6 7 

x 


X 

0 

1 

2 

3 

4 

5 

6 

F(x) 

0.1353 

0.4060 

0.6767 

0.8571 

0.9473 

0.9834 

0.9954 


3.68 (a) g(x) = | 0 * (x + y) dy — x + |, for 0 < x < 1, and h(y) — y + | for 0 < y < 1. 
(b) P(A > 0.5, y > 0.5) = f Q \ f Q \(x + y) dx dy = J 1 + xy) dy 

= /o^ [(I + 2/) - (I + f)] ^=1- 

3.69 /(x) = Q(0.1) a; (l-0.1) 5 - a: , for x = 0,1, 2, 3, 4, 5. 

3.70 (a) y(x) = f 2 dy = 3xy ~ y ^ 2 = | — |, for 1 < x < 3, and 

%) = fi dx = ^ — |y, for 1 < y < 2. 

(b) No, since g(x)h(y) /(x,y). 

(c) P(X > 2) = f 2 3 (| - 1) = (£-f)[ = f- 
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3.71 (a) /(x) = -£F(x) = x/50 : for x > 0. 

(b) P(X > 70) = 1 - P(X < 70) = 1 - F(70) = 1 - (1 - e” 70 / 50 ) = 0.2466. 

3.72 (a) f(x) — A, for x — 1, 2, . . . , 10. 

(b) A c.d.f. plot is shown next. 

1.0 

0.9 

0.8 

0.7 

0.6 
S' 

ir o.5 

0.4 
0.3 
0.2 
0.1 
0 

1 23456789 10 

X 

3.73 P(X >3) = i / 3 °° e~ y/2 = e~ 3/2 = 0.2231. 

3.74 (a) f(x) > 0 and J 0 10 A dx — 1. This is a continuous uniform distribution. 

(b) P(X < 7) = A fj dx = 0.7. 

3.75 (a) f(y) > 0 and f(y) dy = 10 J 0 '(l - y) 9 dy = - ±§(1 - y) 10 ^ = 1. 

(b) P(Y > 0.6) = ./ 0 ' 6 f(y) dy = -( 1 - y) 10 ^ = (1 - 0.6) 10 = 0.0001. 

3.76 (a) P(Z > 20) = A e” 2 / 10 dz = - e- z/10 |“ = e” 20 / 10 = 0.1353. 

(b) P(Z < 10) = - e- 2 / 10 ^ 0 = 1 - e" 10 / 10 = 0.6321. 

3.77 (a) g(xi) = 2 dx 2 = 2(1 — xi), for 0 < xi < 1. 

(b) /i(x2) = Jq 2 2 dx i = 2^2, for 0 < X2 < 1. 

(c) P(Ab < 0.2, X 2 > 0, 5) = f 0 \ f°' 2 2 dx i dx 2 = 2(1 - 0.5) (0.2 - 0) = 0.2. 

(d) = ^ = ^, for 0 < X! < x 2 . 

3.78 (a) fx 1 (xi) = f 0 xi 6x 2 dx 2 = 3x 2 , for 0 < xi < 1. Apparently, fx i(xi) > 0 and 

fo fx i^i) = fo 3xf dx i = 1. So, fxi (3^1) is a density function. 

(b) /*,,*, feM = = SSf = 2 ff> f” 0 < *2 < 

So, P(X 2 < 0.5 I X, = 0.7) = 5^ dx 2 = §. 
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3.79 (a) g(x) = E ^ = ( i 6 9 ) 4 y i_i / 4 = ri> fora: = 0, 1, 2, similarly, h{y) = f — , 

x=0 

for y = 0, 1, 2, ... . Since f(x, y) = g(x)h(y), X and Y are independent. 

(b) P(X + Y < 4) = /( 0, 0) + /(0, 1) + /( 0, 2) + /(0, 3) + /(l, 0) + /( 1, 1) + /( 1, 2) + 
/ (2, 0) + / (2, 1) + / (3, 0) = ^(l + ^ + ^ + ^ + | + ^ + ^ + ^ + ^ + ^-) = 
i(u4i + ii = ® 

3.80 P(the system works) = P(all components work) = (0.95) (0.99) (0.92) = 0.86526. 

3.81 P(the system does not fail) = P(at least one of the components works) 

= 1-P(all components fail) = 1 - (1 - 0.95) (1 - 0.94) (1 - 0.90) (1 - 0.97) = 0.999991. 

3.82 Denote by X the number of components (out of 5) work. 

Then, P(the system is operational) = P ( X > 3) = P(X = 3) + P ( X = 4) + P ( X = 
5) = ( 3 ) (0.92) 3 (1 - 0.92) 2 + Q(0.92) 4 (l - 0.92) + ( 5 KO. 92) 5 = 0.9955. 




Chapter 4 

Mathematical Expectation 


4.1 E(X) = -4 f° [ Vc f y x dx dy = -At (- 

v > 7ra z j—a j _ / a 2_y2 a Ira- 1 


dy = 0. 


4.2 E(X) = '£x f(x) = (0) (27/64) + (1) (27/64) + (2) (9/64) + (3) (1/64) = 3/4. 

x=0 


4.3 y = E{X) = (20) (1/5) + (25)(3/5) + (30)(l/5) = 25 cents. 


4.4 Assigning wrights of 3 w and w for a head and tail, respectively. We obtain P(H ) = 3/4 
and P(T) = 1/4. The sample space for the experiment is S = {El El, HT,TH,TT}. 
Now if X represents the number of tails that occur in two tosses of the coin, we have 


P(X = 0) = P(HH ) = (3/4) (3/4) = 9/16, 

P(X = 1) = P(HT) + P(TH) = (2) (3/4) (1/4) = 3/8, 
P(X = 2) = P(TT) = (1/4) (1/4) = 1/16. 


The probability distribution for X is then 


a; 0 1 2 

f(x) 9/16 3/8 1/16 


from which we get y = E(X ) = (0) (9/16) + (1) (3/8) + (2) (1/ 16) = 1/2. 

4.5 y = E(X) = (0)(0.41) + (1)(0.37) + (2)(0.16) + (3)(0.05) + (4)(0.01) = 0.88. 

4.6 y = E(X) = ($7) (1/12) + ($9) (1/12) + ($11) (1/4) + ($13) (1/4) + ($15) (1/6) + ($17) (1/6) 
= $12.67. 

4.7 Expected gain = E(X) = (4000) (0.3) + (-1000) (0.7) = $500. 

4.8 Let X = profit. Then 

y = E(X) = (250)(0.22) + (150)(0.36) + (0)(0.28) + (-150)(0.14) = $88. 
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4.9 Let c = amount to play the game and Y = amount won. 


y 

5 — c 3 — c — c 

f(y) 

2/13 2/13 9/13 


E{Y) = (5 — c)(2/13) + (3 — c)(2/13) + (— c)(9/13) = 0. So, 13c = 16 which implies 
c = $1.23. 

4.10 p x = Y,xg{x) = (1)(0.17) + (2)(0.5) + (3)(0.33) = 2.16, 
hr = E yh(y) = (1)(0.23) + (2)(0.5) + (3)(0.27) = 2.04. 

4.11 For the insurance of $200,000 pilot, the distribution of the claim the insurance company 
would have is as follows: 


Claim Amount 

$200,000 

$100,000 

$50,000 0 

fix) 

0.002 

0.01 

0.1 0.888 


So, the expected claim would be 


($200, 000)(0.002) + ($100,000)(0.01) + ($50, 000)(0.1) + ($0)(0.888) = $6,400. 
Hence the insurance company should charge a premium of $6, 400 + $500 = $6, 900. 


4.12 E(X) = f 0 ' 2x(l- x) dx = 1/3. So, (l/3)($5, 000) = $1,667.67. 

4.13 E(X) = ±f 0 1 T ^dx= 

- r 1 dx = 


ix JO 1+x 2 

4.14 E(X) — f 0 5 


15 ' 


4.15 E( X) = Jq x 2 dx + f‘ 2 x(2 — x) dx = 1. Therefore, the average number of hours per 
year is (1)(100) = 100 hours. 


4.16 P(X x + X 2 = 

(T)(T) , 

- ( 1 00 0) + 


1 ) = P{X\ = 1 , x 2 = 0 ) + P(Xi = 

980 \ / 20 \ 

Tiwy^ = (2) (0.0392) = 0.0784. 


o,x 2 


1 ) 


4.17 The probability density function is, 


X 

-3 

6 

9 

fix) 

1/6 

1/2 

1/3 

gix) 

25 

169 

361 


p g{X ) = E[( 2X + l) 2 ] = (25) (1/6) + (169) (1/2) + (361)(l/3) = 209. 

4.18 E(X 2 ) = (0) (27/64) + (l)(27/64) + (4) (9/64) + (9)(l/64) = 9/8. 

4.19 Let Y = 1200X — 50A" 2 be the amount spent. 



Solutions for Exercises in Chapter 4 


47 


X 

0 

l 

2 

3 

f(x) 

1/10 

3/10 

2/5 

1/5 

y = g{x) 

0 

1150 

2200 

3150 


p Y = E( 1200X - 50X 2 ) = (0)(1/10) + (1150)(3/10) + (2200)(2/5) + (3150)(l/5) 

= $1,855. 

4.20 E\g(X)\ = E{e 2X / 3 ) = f 0 °° e 2x ^e~ x dx = f 0 °° e ~ x / 3 dx = 3. 

4.21 E(X 2 ) = 2x 2 (1 — x) dx = |. Therefore, the average profit per new automobile is 

(l/6)($5000.00) = $833.33. 

4.22 E(Y) = E(X + 4) = / 0 °° 32(x + 4)^ dx = 8 days. 

4.23 (a) E[g(X, Y)] = E(XY 2 ) =EE ^ 2 /(^, v) 

= (2)(1) 2 (0.10) + (2)(3) 2 (0.20 i ) + (2)(5) 2 (0.10) + (4)(1) 2 (0.15) + (4)(3) 2 (0.30) 

+ (4)(5) 2 (0.15) = 35.2. 

(b) p x = E{X) = (2) (0.40) + (4) (0.60) = 3.20, 

p Y = E(Y) = (1)(0.25) + (3)(0.50) + (5)(0.25) = 3.00. 

3 2 

4.24 (a) E(X 2 Y — 2XY) = £ E (x 2 y - 2xy)f(x, y) = (1 - 2) (18/70) + (4 - 4) (18/70) + 

x = 0 y = 0 

••• + (8-8)(3/70) = -3/7. 

,, . x 0 1 23 y 0 1 2 

^ ’ g(x) 5/70 30/70 30/70 5/70 h(y) 15/70 40/70 15/70 
px = E(X) = (0)(5/70) + (1) (30/70) + (2) (30/70) + (3) (5/70) = 3/2, 
p Y = E(Y) = (0) (15/70) + (1) (40/70) + (2) (15/70) = 1. 

3 3 

4.25 px+Y = E( X + Y) — E E ( x + y)f(x, y) — (0 + 0) (1/55) + (1 + 0) (6/ 55) + • • ■ + (0 + 

#=0 y = 0 

3) (1/55) = 2. 

4.26 E(Z) = E(VX* + Y*) = J Q ' 4xy^TV dx dy = § ^[y{l + y 2 f 2 - y 4 ] dy 
= 8(2 3 / 2 - 1)/15 = 0.9752. 

4.27 E( X) = 2 fjoQ / 0 °° xexp(— x/2000) dx = 2000 y exp(-y) dy = 2000. 

4.28 (a) The density function is shown next. 



23.75 26.25 
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(b) E(X) = f dx = |(26.25 2 - 23.75 2 ) = 25. 

(c) The mean is exactly in the middle of the interval. This should not be surprised 
due to the symmetry of the density at 25. 

4.29 (a) The density function is shown next 



(b) p = E(X) = ff° 3x 3 dx = §. 

4.30 E(Y) = i/ 0 °° ye~y/ 4 dy = 4. 

4.31 (a) p = E(Y) = 5 /q y(l — y ) 4 dy = - f* y d(l - yf = J 0 °°(l-y ) 5 dy = |. 

(b) P(Y > 1/6) = 5(1 - yf dy = - (1 - yf\\ /& = (1 - 1/6) 5 = 0.4019. 

4.32 (a) A histogram is shown next. 



1 2 3 4 5 

x 

(b) p = (0)(0.41) + (1)(0.37) + (2)(0.16) + (3)(0.05) + (4)(0.01) = 0.88. 

(c) E(X 2 ) = (0) 2 (0.41) + (1) 2 (0.37) + (2) 2 (0.16) + (3) 2 (0.05) + (4) 2 (0.01) = 1.62. 

(d) Var(X) = 1.62 - 0.88 2 = 0.8456. 

4.33 /j = $500. So, u 2 = E[(X - p) 2 } = - y) 2 f(x) = (-1500) 2 (0.7) + (3500) 2 (0.3) = 

$5,250,000. 


X 
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4.34 p = (— 2)(0.3) + (3)(0.2) + (5)(0.5) = 2.5 and 
E(X 2 ) = (— 2) 2 (0.3) + (3) 2 (0.2) + (5) 2 (0.5) = 15.5. 

So, ex 2 = E(X 2 ) — p 2 = 9.25 and a = 3.041. 

4.35 p = (2)(0.01) + (3)(0.25) + (4)(0.4) + (5)(0.3) + (6)(0.04) = 4.11, 

E(X 2 ) = (2) 2 (0.01) + (3) 2 (0.25) + (4) 2 (0.4) + (5) 2 (0.3) + (6) 2 (0.04) = 17.63. 

So, a 2 = 17.63-4.il 2 = 0.74. 

4.36 p = (0)(0.4) + (1)(0.3) + (2)(0.2) + (3)(0.1) = 1.0, 

and E(X 2 ) = (0) 2 (0.4) + (1) 2 (0.3) + (2) 2 (0.2) + (3) 2 (0.1) = 2.0. 

So, a 2 = 2.0 - 1.0 2 = 1.0. 

4.37 It is know p = 1/3. 

So, E( X 2 ) = 2x 2 (1 — x) dx = 1/6 and a 2 = 1/6 — (1/3) 2 = 1/18. So, in the actual 

profit, the variance is jg(5000) 2 . 

4.38 It is known p = 8/15. 

Since E( X 2 ) = f ( ‘ | x 2 (x + 2) dx = //, then a 2 = 11/30 — (8/15) 2 = 37/450. 

4.39 It is known p — 1. 

Since E(X 2 ) = f^x 2 dx + fi x 2 ( 2 — x) dx = 7/6, then a 2 = 7/6 — (l) 2 = 1/6. 

4.40 p g (x) = E[g(X)] = f*( 3x 2 + 4) dx = \ fo(6% 3 + 12a; 2 + 8a; + 16) dx = 5.1. 

So, cr 2 = E[g{ X) — p] 2 = Jq(3x 2 + 4 — 5.1) 2 (^-^) dx 

= f 0 \9x 4 - 6.6a; 2 + 1.21) (^±4) dx = 0.83. 

4.41 It is known p g (x) = E[{2X + l) 2 ] = 209. Hence 
<x) = E[(2* + l) 2 - 209] 2 «/(x) 

= (25 - 209) 2 (l/6) + (169 - 209) 2 (l/2) + (361 - 209) 2 (l/3) = 14, 144. 

So, a g(x) = = 118.9. 

4.42 It is known p g ( X ) = E(X 2 ) = 1/6. Hence 
a g(x) = Jo 2 ( x 2 - I) 2 (! - x ) dx = 7 /!80. 

4.43 p Y = E(3X - 2) = \ f 0 °°(3x - 2)e~ x/4 dx = 10. So 

cr 2 - = E{[(3X - 2) - 10] 2 } = f f 0 °°(x - 4) 2 e~ x / 4 dx = 144. 

4.44 E(XY) = EE^/(x,t/) = (1) (1) (18/70) + (2) (1) (18/70) 

+ (3) (1) (2/70) + (1) (2) (9/70) + (2) (2) (3/70) = 9/7; 

= xf(x, y) = (0)/(0, 1) + (0)/(0, 2) + (1)/(1, 0) H h (3)/(3, 1) = 3/2, 

x y 

and py — 1. 

So, = E(XY) - p x pY = 9/7 - (3/2) (1) = -3/14. 
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4.45 [I x = J2 x 9(.x) = 2.45, p Y = E ?/%) = 2.10, and 

E(XY) = EE^,t/) = (1)(0.05) + (2)(0.05) + (3)(0.10) + (2)(0.05) 

+ (4)(0.10) + (6) (0.35) + (3)(0) + (6)(0.20) + (9)(0.10) = 5.15. 

So, (7 xy = 5.15 - (2.45)(2.10) = 0.005. 


4.46 From previous exercise, k = (^) 10 4 , and g(x ) = k (20a; 2 + 98000 ) , with 
(ix = E(X) = xg(x) dx = k J 3 g° (20a; 3 + 98900 a.') dx = 40.8163. 
Similarly, (i Y = 40.8163. On the other hand, 

E(XY ) = k J 3 5 q 0 xy(x 2 + y 2 ) dy dx = 1665.3061. 

Hence, a xy = E(XY) - p x pv = 1665.3061 - (40.8163) 2 = -0.6642. 


4.47 g(x) = | | 0 * (x + 2 y) dy = |(a; + 1, for 0 < x < 1, so p x = § Jq x{x + 1) dx = |; 
%) = | fo( x + 2 v) dx =l{\ + 2y), so p Y = ! /q y (| + 2 y) dy = and 
E(XY ) = | Jq 1 au/(a; + 2y) dy dx — |. 

So, cr X y = E(XY) - p x p Y = | - (|) (y§) = -0.0062. 


4.43 




P = £2Q1 (Xy = 


be ri 


\/ a x b2ry2 x 

Hence p = 1 if 6 > 0 and p = — 1 if b < 0 


W\ = sl s n 


4.49 E(X) = (0)(0.41) + (1)(0.37) + (2)(0.16) + (3)(0.05) + (4)(0.01) = 0.88 

and E(X 2 ) = (0) 2 (0.41) + (1) 2 (0.37) + (2) 2 (0.16) + (3) 2 (0.05) + (4) 2 (0.01) = 1.62. 
So, Var(X) = 1.62 - 0.88 2 = 0.8456 and a = V0.8456 = 0.9196. 


4.50 


E(X) = 2 /g x(l - x) dx = 2 (4 - 
E(X 2 ) = 2 Jg 1 a; 2 (l - x) dx = 2 (*£ 
V ar(X) = | — (|) 2 = 2g, and a = 


= | and 


= jL Hence, 


y/l/18 = 0.2357. 


4.51 Previously we found p = 4.11 and a 2 = 0.74, Therefore, 

p g{X ) = E(3X - 2) = 3p - 2 = (3)(4.11) - 2 = 10.33 and a g(X ) = 9 a 2 

4.52 Previously we found p — 1 and a 2 = 1. Therefore, 

= £(5X + 3) = 5/i + 3 = (5)(1) + 3 = 8 and a g{x) = 25 a 2 = 25. 


6 . 66 . 


4.53 Let X = number of cartons sold and Y = profit. 

We can write Y = 1.65X + (0.90) (5 — X) — 6 = 0.75A" — 1.50. Now 

E{X) = (0) (1/15) + (1) (2/15) + (2) (2/ 15) + (3)(3/15) + (4)(4/15) + (5)(3/15) = 46/15, 

and E(Y) = (0.75 )E(X) - 1.50 = (0.75) (46/15) - 1.50 = $0.80. 

4.54 p x = E(X) = \ f 0 °° xe~ x/4 dx = 4. 

Therefore, p Y = E{3X — 2) = 3 E(X) — 2 = (3) (4) — 2 = 10. 

Since E(X 2 ) = / / 0 °° x 2 e~ x / 4 dx = 32, therefore, a\ = E(X 2 ) — p 2 x = 32 — 16 = 16. 
Hence o\ = 9 o 2 x = (9) (16) = 144. 
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4.55 E(X) = ( 3) (1/6) + (6) (1/2) + (9)(l/3) = 11/2, 

E(X 2 ) = (— 3) 2 (l/6) + (6) 2 (l/2) + (9) 2 (l/3) = 93/2. So, 

E[(2X + l) 2 ] = 4 E{X 2 ) + 4 E(X) + 1 = (4) (93/2) + (4) (11/2) + 1 = 209. 

4.56 Since E{ X) = f ( ' x 2 dx + x(2 — x) dx = 1, and 

E(X 2 ) = f* x 3 2 dx + jf x 2 (2 — x) dx = 7/6, then 

E(Y ) = 60P(A" 2 ) + 39P(X) = (60) (7/6) + (39) (1) = 109 kilowatt hours. 

4.57 The equations E[(X — l) 2 ] = 10 and E[( X — 2) 2 ] = 6 may be written in the form: 

E(X 2 ) - 2 E(X) = 9, E(X 2 ) - 4 E(X) = 2. 

Solving these two equations simultaneously we obtain 

E(X) = 7/2, and E(X 2 ) = 16. 

Hence /i = 7/2 and cr 2 = 16 — (7/2) 2 = 15/4. 

4.58 E(X) = (2) (0.40) + (4) (0.60) = 3.20, and 

E(Y) = (1)(0.25) + (3)(0.50) + (5)(0.25) = 3. So, 

(a) E(2X - 3 Y) = 2 E(X) - 3 E(Y) = (2) (3.20) - (3) (3.00) = -2.60. 

(b) E(XY) = E(X)E(Y) = (3.20) (3.00) = 9.60. 

4.59 E(2XY 2 - X 2 Y) = 2 E(XY 2 ) - E(X 2 Y). Now, 

E(XY 2 ) = E E xy 2 f(x,y) = (1)(1) 2 (3/14) = 3.14, and 

x=0 y = 0 

E{X 2 Y) =EE X 2 yf(x,y) = (l) 2 (1) (3/14) = 3.14. 

y = 0 

Therefore, E{2XY 2 - X 2 Y) = (2) (3/14) - (3/14) = 3/14. 

4.60 Using p = 60 and cr = 6 and Chebyshev’s theorem 

P(ji — kcr < X < p + kcr ) > 1 — — , 

rC 

since from p + kcr = 84 we obtain k — 4. 

So, P(X < 84) > P(36 < X < 84) > 1 - ^ = 0.9375. Therefore, 

P(X > 84) < 1 - 0.9375 = 0.0625. 

Since 1000(0.0625) = 62.5, we claim that at most 63 applicants would have a score as 
84 or higher. Since there will be 70 positions, the applicant will have the job. 

4.61 p = 900 hours and cr = 50 hours. Solving p — kcr = 700 we obtain k — 4. 

So, using Chebyshev’s theorem with P(p — 4cr < X < p + 4cr) > 1 — 1/4 2 = 0.9375, 
we obtain P(700 < X < 1100) > 0.9375. Therefore, P(X < 700) < 0.03125. 
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4.62 fa — - 52 and a = 6.5. Solving p + ka = 71.5 we obtain k — 3. So, 


P(p — 3a < X < p + 3er) > 1 


1 

32 


0.8889, 


which is 


P( 32.5 < X < 71.5) > 0.8889. 

we obtain P ( X > 71.5) < 1 ~ a ) 88h!> = 0.0556 using the symmetry. 


4.63 n = 500, p = 4.5 and a = 2.8733. Solving p + k[a / \/500) = 5 we obtain 

5-4.5 0.5 


k = 


= 3.8924. 


2.87333//500 0.1284 

So, P(4 < X < 5) > 1 - ^ = 0.9340. 

4.64 a\ = 4 2 x+4y - 3 = 4 a 2 x + 16 = (4) (5) + (16) (3) = 68. 


4.65 erf = cr 2 


—2X+4Y—3 = 4 4 + 164 - 16er xy = (4) (5) + (16) (3) - (16) (1) = 52. 


4.66 (a) P(6 < X < 18) = P[12 - (2) (3) < X < 12 + (2) (3)] > 1 - £ = f 

(b) P( 3 < X < 21) = P[12 - (3)(3) < X < 12 + (3)(3)] > 1 - £ = § 

4.67 (a) P( \X - 10| > 3) = 1 - P( \X - 10| < 3) 

= 1 - P[10 - (3/2) (2) < X < 10 + (3/2) (2)] < 1 - [l - 


(3/2)2 


5 

9’ 


21 

25' 


(b) P( \X — 10| < 3) = 1 — P( \X - 10| > 3) > 1 - j 

(c) P( 5 < X < 15) = P[10 - (5/2) (2) < X < 10 + (5/2)(2)] > 1 - ^ 

(d) P( \X — 10| > c) < 0.04 implies that P( \X — 10 1 < c) > 1 — 0.04 = 0.96. 

Solving 0.96 = 1 — p- we obtain k — 5. So, c = ka = (5) (2) = 10. 

4.68 p = E( X) = 6 Jq x 2 ( 1 — x) dx = 0.5, E( X 2 ) = 6 f ( ' x 3 (l — x) dx = 0.3, which imply 
o 2 = 0.3 — (0.5) 2 = 0.05 and a = 0.2236. Hence, 


P{H - 2a < X < fi + 2er) = P(0.5 - 0.4472 < X < 0.5 + 0.4472) 

m.9472 

= P(0.0528 < X < 0.9472) = 6 / x(l - x) dx = 0.9839, 

J 0.0528 

compared to a probability of at least 0.75 given by Chebyshev’s theorem. 

4.69 It is easy to see that the expectations of X and Y are both 3.5. So, 

(a) E(X + Y ) = E(X) + E(Y) = 3.5 + 3.5 = 7.0. 

(b) E(X - Y) = E{X) - E(Y) = 0. 
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(c) E(XY) = E(X)E(Y) = (3.5) (3.5) = 12.25. 

4.70 E(Z) = E(XY) = E{X)E{Y) = ff f 2 °° 16 xy(y/x 3 ) dx dy = 8/3. 

4.71 E\g(X, y)] = E(X/Y 3 + X 2 Y ) = E(X/Y 3 ) + E(X 2 Y). 

E(X/Y 3 ) = ff If 2x( g 2y) dxdy= f Jf + fi) dy = |f; 

E(X 2 Y)=ffff^±Mdxdy = lffy(i + f) dy = § §. 

Hence, E[g(X, Y)] = g + f§ = |. 

4.72 g x = iiY = 3-5. 4 = 4 = [(l) 2 + (2) 2 + • • • + (6) 2 ](l/6) - (3.5) 2 = ff. 

(a) o 2 x-y = 4 a\ + 4> = 

(b) <T X +3Y- 5 = 4 + 94 = 1 p- 

4.73 (a) g = \ ff x dx = 2.5, a 2 = E(X 2 ) — g 2 = jrx 2 ff x 2 dx — 2.5 2 = 2.08. 

So, a = y/a 2 = 1.44. 

(b) By Chebyshev’s theorem, 

P[2.5 - (2)(1.44) < X < 2.5 + (2)(1.44)] = P(-0.38 < X < 5.38) > 0.75. 

Using integration, P(— 0.38 < X < 5.38) = 1 > 0.75; 

P[2.5 - (3)(1.44) < X < 2.5 + (3)(1.44)] = P(-1.82 < X < 6.82) > 0.89. 

Using integration, P(— 1.82 < X < 6.82) = 1 > 0.89. 

4.74 P = I 2 R with R = 50, gj = E(I) = 15 and aj = Var(I) = 0.03. 

E(P) = E(P 2 R) = 50 E(I 2 ) = 50 [Var(I) + gj] = 50(0.03 + 15 2 ) = 11251.5. If we use 
the approximation formula, with g(I) = 1 2 , g'(I) = 21 and g"(I ) = 2, we obtain, 

2 " 

P(P)« 50 g(g T ) + 2°-±- = 50(15 2 + 0.03) = 11251.5. 

Since Var[g(I)] « a 2 , we obtain 

L - 1 1=1*1 

Var(P) = 50 2 Var(I 2 ) = 50 2 (2 gifoj = 50 2 (30) 2 (0.03) = 67500. 

OO OO 

4.75 For 0 < a < 1, since g(a) — Yf ° x — . r /( a ) = 4 xa x ~ 1 = and 

x=0 x=l 

00 

9 " (a) = E l)^- 2 = 

x=2 
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(a) E( X) = (3/4) E i(l/4)' = (3/4)(l/4) E>(l/4)'-‘ = (3/16)[l/(l - 1/4) 2 ] 

— 1/3, and E(Y) — E(X) — 1/3. 

oo 

E(X 2 ) - E(X) = E[X(X - 1)] = (3/4) £ x(x - l)(l/4)* 

x=2 

oo 

= (3/4) (1/4) 2 £ x(x - 1) (1/ 4) a:— 2 = (3/ 4 3 ) [2 / (1 - 1/4) 3 ] = 2/9. 

So, Var(X) = E(X 2 ) - [E(X)] 2 = [E(X 2 ) - E(X)] + E(X) - [E(X)f 
2/9 + 1/3 - (1/3) 2 = 4/9, and Var(Y ) = 4/9. 

(b) E(Z) = E(X) + E(Y) = (1/3) + (1/3) = 2/3, and 

Var(Z) = V ar{X + Y) = Var(X) + Var(Y ) = (4/9) + (4/9) = 8/9, since X and 
Y are independent (from Exercise 3.79). 

4.76 (a) g(x) = § / 0 ' (x 2 + y 2 ) dy = |(3a: 2 + 1) for 0 < x < 1 and 

/i(y) = |(3y 2 -r 1) for ()<//< 1. 

Since f(x,y ) ^ g(x)h(y), X and Y are not independent. 

(b) E(X + 1") = E(X) + E(Y) = 2 E(X) = x(3x 2 + 1) dx = 3/4 + 1/2 = 5/4. 

E(XY) = I /q 1 xy{x 2 + 2 / 2 ) dx dy= § y(\ + ^j dy 

= I [(I) (I) + (I) (1)] = §■ 

(c) Var(X) = E(X 2 ) - [E(X)} 2 = ± £ x 2 (3x 2 + 1) dx - (§) 2 = £ - f = gf and 
Ear(F) = gf Also, Cou(X,F) = E(XY) - E(X)E(Y) = § - (§) 2 = -g. 

(d) Uar(X + y) = Var(X) + Var(Y ) + 2C'on(X, F) = 2^ - 2 g = ||. 

4.77 (a) E(Y) = f 0 °° ye~ y/4 dy = 4. 

(b) E(Y 2 ) = f 0 °° y 2 e~ y / 4 dy = 32 and Var(Y ) = 32 - 4 2 = 16. 

4.78 (a) The density function is shown next. 




(b) E(Y) = fy y dy = i[8 2 - 7 2 ] = f = 7.5, 

E(Y 2 ) = ffy 2 dy = i[8 3 - 7 3 ] = if, and Var(Y ) = if - (f ) 2 = g. 

4.79 Using the exact formula, we have 



1884.32. 
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Using the approximation, since g(y ) = e y , so g"(y) = e y . Hence, using the approxima- 
tion formula, 


E(e Y ) 




+ e 


Mr 


(X 


y 



1883.38. 


The approximation is very close to the true value. 

4.80 Using the exact formula, E(Z 2 ) = f* e 2y dy — \ e 2y \\ = 3841753.12. Hence, 

Var(Z) = E(Z 2 ) - [E(Z)f = 291091.3. 


Using the approximation formula, we have 

(2)(7.5) 

Var(e Y ) = (e^ Y ) 2 Var(Y) = = 272418.11. 

The approximation is not so close to each other. One reason is that the first order 
approximation may not always be good enough. 

4.81 Define I\ = {x;| joy — y\ < ka} and 1-2 = {ay| \xi — y\ > ka}. Then 

a 2 = E[{ X - y) 2 ) = ^(x - y) 2 f(x) = ^ (u ~ k) 2 f{ x i) + ( x * ~ d) 2 f( x i) 

X Xi£h Xi&I 2 

> (Xi - y) 2 f(xi) > k 2 a 2 ^2 f( x i) = k 2 a 2 p(\x - y\ > ka), 

Xi£h Xi&h 

which implies 

P{\X -y\ > ka) < 

Hence, P(\X — y\ < ka) > 1 — p. 

4.82 E(XY) = fo xy(x+y) dx dy = E(X) = x(x+y) dx dy = -^ and E(Y) = -E. 
Therefore, a X Y = E(XY) - y x y Y = § - (^) 2 = -y^- 

4.83 E(Y' — Yi) = Jj-j 1 J Q y 2(r/ — x) dx dr/ = y 2 dy = Therefore, the average amount of 
kerosene left in the tank at the end of each day is (1/3)(1000) = 333 liters. 

4.84 (a) E( X) = f 0 °° §e~ x / 5 dx = 5. 

(b) E( X 2 ) = / 0 °° f e ~ x / 5 dx = 50, so Var(X) = 50 - 5 2 = 25, and a = 5. 

(c) E[( X + 5) 2 ] = E{[(X - 5) + 10] 2 } = E[(X - 5) 2 ] + 10 2 + 20 E(X - 5) 

= Var(X) + 100 = 125. 

4.85 E(XY) = 24 f^ y x 2 y 2 dx dy = 8 £ y 2 ( 1 - y) 3 dy = X 

y x = 24 f Q f Q y x 2 y dx dy = I and /iy = 24 f Q f Q y xy 2 dx dy = |. Therefore, 
= E(XY) - /iy/iy = ^ - (I ) 2 = 
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4.86 E(X + Y) = f* fj y 24(x + y)xy dx dy = §. 

4.87 (a) E{ X) = ff° ^L e -P 900 dx = 900 hours. 

(b) P(X 2 ) = / 0 °° ^e-*/ 900 dx = 1620000 hours 2 . 

(c) V ar(X ) = E( X 2 ) — [E( X)} 2 = 810000 hours 2 and a = 900 hours. 

4.88 It is known g(x) = |( x + 1), for 0 < x < 1, and h(y) = |(1 + 4 y), for 0 < y < 1. 

(a) p x = fo §x(x + 1) dx = | and p Y = fg §2/(1 + 4 y) dy = §§• 

(b) E[(X + Y)f 2] = \[E{X) + E(Y)} = 

4.89 C'ou(aX, 6F) = P[(aX - ap x )(bY - 6/i y )] = a&P[(X - p x ){Y - p Y )] = abCov{X , F). 

4.90 It is known p = 900 and a = 900. For k = 2, 

P(/j - 2a < X < p + 2a) = P(— 900 < X < 2700) > 0.75 
using Chebyshev’s theorem. On the other hand, 

P(p - 2a < X < p + 2a) = P(-900 < X < 2700) = 1 - e" 3 = 0.9502. 

For k = 3, Chebyshev’s theorem yields 

P(p-3a < X < p + 3a) = P(- 1800 < X < 3600) > 0.8889, 
while P(— 1800 < X < 3600) = 1 - e“ 4 = 0.9817. 

4.91 g(x) = fo ^ dy = Jf, for x > 2, with p x = ff° X dx = - §|“ = 4, 

h(y) = f 2 °° dx = - |||“ = 2 y, for 0 < y < 1, with p Y = f 2 y 2 = §, and 

E(XY) = f 2 °° fo dy dx = ™ ff° dx = §. Hence, 
a XY = E(XY ) — p x p Y = | — (4) (|) = 0. 

4.92 Since a\ Y = 1, = 5 and af = 3, we have p = axY = , 1 = 0.2582. 

’ x r °x*y y / ( 5 )( 3 ) 

4.93 (a) From Exercise 4.37, we have a 2 = 1/18, so a = 0.2357. 

(b) Also, p x = 1/3 from Exercise 4.12. So, 

P(p - 2a < X < p + 2a) = P[ 1/3 - (2)(0.2357) < X < 1/3 + (2)(0.2357)] 

,> 0.8047 

= P(0 < X < 0.8047) = / 2(1 - x) dx = 0.9619. 

Jo 

Using Chebyshev’s theorem, the probability of this event should be larger than 
0.75, which is true. 

(c) P(profit > $500) = P(X > 0.1) = f ' , 2(1 — x) = 0.81. 
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4.94 Since g(0)h(0) = (0.17)(0.23) ^ 0.10 = /( 0,0), X and Y are not independent. 

4.95 E(X) = (— 5000)(0.2) + (10000)(0.5) + (30000)(0.3) = $13,000. 

4.96 (a) f(x) = Q (0.15) x (0.85) 3 ~ x , for x = 0, 1, 2, 3. 


X 

0 

1 

2 

3 

f(x) 

0.614125 

0.325125 

0.057375 

0.003375 


(b) E(X) = 0.45. 

(c) E(X 2 ) = 0.585, so Var(X) = 0.585 - 0.45 2 = 0.3825. 

(d) P(X < 2) = 1 - P(X = 3) = 1 - 0.003375 = 0.996625. 

(e) 0.003375. 

(f) Yes. 

4.97 (a) E(X) = (-$15k)(0.05) + ($15k)(0.15) + ($25k)(0.30) + ($40k)(0.15) + ($50k)(0.10) + 

($100k)(0.05) + ($150k)(0.03) + ($200k)(0.02) = $33.5k. 

(b) E(X 2 ) = 2, 697, 500, 000 dollars 2 . So, a = y/E(X 2 ) - [E(X)} 2 = $39.689k. 

4.98 (a) E(X) = ^ /_ 5 ° 0 *(50 2 - * 2 ) dx = 0. 

(b) E(X 2 ) = ^3 x 2 (50 2 - x 2 ) dx = 500. 

(c) a = y/E(X 2 ) - [E(X )} 2 = V500 - 0 = 22.36. 

4.99 (a) The marginal density of X is 


Xi 

0 1 

2 3 

4 

fXi(x l) 

0.13 0.21 

0.31 0.23 

0.12 


(b) The marginal density of Y is 


x 2 

0 

12 3 

4 

fx 2 (x 2 ) 

0.10 

0.30 0.39 0.15 

0.06 


(c) Given X 2 = 3, the conditional density function of Y , is f(x i , 3)/0. 15. So 

Xi 0 12 3 4 

f y (xY -1 I X 1 X 

JX 2 \^ 2 ) 15 5 15 5 15 

(d) E(X i) = (0)(0.13) + (1)(0.21) + (2)(0.31) + (3)(0.23) + (4)(0.12) = 2. 


(e) E(X 2 ) = (0)(0.10) + (1)(0.30) + (2)(0.39) + (3)(0.15) + (4)(0.06) = 1.77. 

(f) E(X i|X 2 = 3) = (0) (i) + (1) (|) + (2) (i) + (3) (|) + (4) (i) = i = | = 1.2. 


(g) E(X 2 ) = (0) 2 (0.13) + (1) 2 (0.21) + (2) 2 (0.31) + (3) 2 (0.23) + (4) 2 (0.12) = 5.44. 
So, a Xl = V E ( X 1) ~ l E ( x i)} 2 = ^5.44 - 2 2 = y/tte = 1.2. 


4.100 (a) The marginal densities of A" and Y are, respectively, 
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X 

0 12 

y 

0 

1 

2 

g(x) 

0.2 0.32 0.48 

Kv) 

0.26 

0.35 

0.39 

conditional density of X given Y = 

2 is 





X 

0 12 

fx |y=2(^ 2) 

3 5 3D 

39 39 39 


(b) E(X) = (0)(0.2) + (1)(0.32) + (2)(0.48) = 1.28, 

E(X 2 ) = (0) 2 (0.2) + (1) 2 (0.32) + (2) 2 (0.48) = 2.24, and 
Var(X) = 2.24 - 1.28 2 = 0.6016. 

(c) E(X\Y = 2) = (1)| + (2)| = | and E(X 2 \Y = 2) = (1) 2 | + (2) 2 § = §>. So, 
Var(X) = 125 - (65) 2 = 650. = jo. 

va '\ yX ) 39 V 39 / 1521 117 ' 


4.101 The profit is 8A + 3 Y — 10 for each trip. So, we need to calculate the average of this 
quantity. The marginal densities of X and Y are, respectively, 


X 

0 1 

2 

y 

0 

1 2 

3 

4 

5 

g(x) 

0.34 0.32 

0.34 

Kv) 

0.05 

0.18 0.15 

0.27 

0.19 

0.16 


So, E(8X+3Y-10) = (8) [(1) (0.32) + (2) (0.34)] + (3) [(1) (0.18) + (2) (0.15) + (3) (0.27) + 
(4)(0.19) + (5)(0.16)] - 10 = $6.55. 


4.102 Using the approximation formula, Var(Y) 
have 


E k 
2=1 


dh(x i i X2,—,Xk) 
dxi 


we 


Xi=fii , l<i<k 


VariY) w ^ 


i=0 


Q e bo+biki+b2k2 \ - 

dbi ) 


O", 


[ = e 2 ^ +k ^ 1 +k ^\a 2 + kjal + k 2 2 a 2 ] 


bi=(3i, 0<i<2 


4.103 (a) E(Y) = 10 J, 1 y( 1 - yf dy = - y( 1 - y) 10 |; + J 0 '(l - y) 10 dy = 

(b) E( 1 - Y) = 1 - E(Y) = g. 

(c) Var(Z) = Var{ 1 - Y) = Var(Y) = E(Y 2 ) - [E(Y)} 2 = TT ^ = 0.006887. 



Chapter 5 


Some Discrete Probability 
Distributions 

5.1 This is a uniform distribution: f(x) — X, for x — 1, 2, . . . , 10. 

Therefore P(X <4 ) = f{x) = 

X=1 

5.2 Binomial distribution with n = 12 and p = 0.5. Hence 

P(X = 3) = P(X < 3) - P(X < 2) = 0.0730 - 0.0193 = 0.0537. 

5.3 p — ^ = 5.5, and a 2 = ^ = 8.25. 

X=1 X=1 

5.4 For n — 5 and p = 3/4, we have 

(a) P(X = 2) = @(3/4) 2 (l/4) 3 = 0.0879, 

3 

(b) P(X < 3) = £ 6(x; 5, 3/4) = 1 - P(X = 4) - P(X = 5) 

x = 0 

= 1 - (4) (3/4) 4 (l/4) 1 - (g) (3/4) 5 (l/4)° = 0.3672. 

5.5 We are considering a b(x; 20, 0.3). 

(a) P(X > 10) = 1 - P(X < 9) = 1 - 0.9520 = 0.0480. 

(b) P(X < 4) = 0.2375. 

(c) P ( X = 5) = 0.1789. This probability is not very small so this is not a rare event. 
Therefore, P = 0.30 is reasonable. 

5.6 For n — 6 and p — 1/2. 

(a) P( 2 < X < 5) = P(X < 5) - P(X < 1) = 0.9844 - 0.1094. 

(b) P(X < 3) = P(X < 2) = 0.3438. 

5.7 p = 0.7. 
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(a) For n = 10, P ( X < 5) = P( X < 4) = 0.0474. 

(b) For n = 20, P(X < 10) = P(X < 9) = 0.0171. 

5.8 For n — 8 and p = 0.6, we have 

(a) P(X = 3) = b{ 3; 8, 0.6) = P(X < 3) - P(X < 2) = 0.1737 - 0.0498 = 0.1239. 

(b) P(X > 5) = 1 - P(X < 4) = 1 - 0.4059 = 0.5941. 

5.9 For n = 15 and p = 0.25, we have 

(a) P( 3 < X < 6) = P(X < 6) - P(X < 2) = 0.9434 - 0.2361 = 0.7073. 

(b) P(X < 4) = P(X < 3) = 0.4613. 

(c) P(X > 5) = 1 - P(X < 5) = 1 - 0.8516 = 0.1484. 

5.10 From Table A.l with n = 12 and p = 0.7, we have 

(a) P( 7 < X < 9) = P(X < 9) - P(X < 6) = 0.7472 - 0.1178 = 0.6294. 

(b) P(X < 5) = 0.0386. 

(c) P(X > 8) = 1 - P(X < 7) = 1 - 0.2763 = 0.7237. 

5.11 From Table A.l with n = 7 and p = 0.9, we have 

P(X = 5) = P(X < 5) - P(X < 4) = 0.1497 - 0.0257 = 0.1240. 

5.12 From Table A.l with n — 9 and p = 0.25, we have P ( X < 4) = 0.8343. 

5.13 From Table A.l with n — 5 and p = 0.7, we have 
P(X > 3) = 1 - P(X < 2) = 1 - 0.1631 = 0.8369. 

5.14 (a) n = 4, P(X = 4) = 1 - 0.3439 = 0.6561. 

(b) Assuming the series went to the seventh game, the probability that the Bulls won 
3 of the first 6 games and then the seventh game is given by 

(0.9) 3 (0.1) 3 (0.9) = 0.0131. 

(c) The probability that the Bulls win is always 0.9. 

5.15 p — 0.4 and n = 5. 

(a) P(X = 0) = 0.0778. 

(b) P(X < 2) = P(X < 1) = 0.3370. 

(c) P(X > 3) = 1 - P(X < 3) = 1 - 0.9130 = 0.0870. 
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5.16 Probability of 2 or more of 4 engines operating when p = 0.6 is 

P{X > 2) = 1 - P(X < 1) = 0.8208, 
and the probability of 1 or more of 2 engines operating when p = 0.6 is 

P(X > 1) = 1 - P(X = 0) = 0.8400. 

The 2-engine plane has a slightly higher probability for a successful flight when p = 0.6. 

5.17 Since p — np — (5)(0.7) = 3.5 and o 2 = npq = (5)(0.7)(0.3) = 1.05 with a = 1.025. 
Then p ± 2a = 3.5 ± (2) (1.025) = 3.5 ± 2.050 or from 1.45 to 5.55. Therefore, at least 
3/4 of the time when 5 people are selected at random, anywhere from 2 to 5 are of the 
opinion that tranquilizers do not cure but only cover up the real problem. 

5.18 (a) p — np = (15)(0.25) = 3.75. 

(b) With k = 2 and a = J npq = v /(15)(0.25)(0.75) = 1.677, p±2a = 3.75 ± 3.354 
or from 0.396 to 7.104. 

5.19 Let Xi = number of times encountered green light with P(Green) = 0.35, 

X 2 = number of times encountered yellow light with P(Ycllow) = 0.05, and 
X3 = number of times encountered red light with P(Red) = 0.60. Then 

f{x I,x 2 ,x 3 ) = ( n Vo.35)* 1 (0.05)«(0.60)*>. 

\X l ,X 2 ,X 3 J 


5.20 (a) ( 2 5° 3 ) (0-225) 2 (0.544) 5 (0.231) 3 = 0.0749. 

(b) (JJ)(0.544) 10 (0.456)° = 0.0023. 

(c) ( 1 0 °)(0.225)°(0.775) 10 = 0.0782. 

5.21 Using the multinomial distribution with required probability is 


7 

0,0, 1,4, 2 


(0.02)(0.82) 4 (0.1) 2 


0.0095. 


5.22 Using the multinomial distribution, we have ( 5 * 1 )(l/2) 5 (l/4) 2 (l/4) = 21/256. 

5.23 Using the multinomial distribution, we have 

(3 3^ 2 )(°- 4 ) 3 (0.2) 3 (0.3)(0.2) 2 = 0.0077. 


5.24 p = 0.40 and n = 6, so P(X = 4) = P{X < 4 )-P(X < 3) = 0.9590-0.8208 = 0.1382. 

5.25 n = 20 and the probability of a defective is p = 0.10. So, P( X < 3) = 0.8670. 
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5.26 n — 8 and p = 0.60; 

(a) P(X = 6) = (®) (0.6) 6 (0.4) 2 = 0.2090. 

(b) P(X = 6) = P(X < 6) - P(X < 5) = 0.8936 - 0.6846 = 0.2090. 

5.27 n = 20 and p = 0.90; 

(a) P(X = 18) = P(X < 18) - P(X < 17) = 0.6083 - 0.3231 = 0.2852. 

(b) P(X > 15) = 1 - P(X < 14) = 1 - 0.0113 = 0.9887. 

(c) P(X < 18) = 0.6083. 


5.28 n = 20; 


(a) p = 0.20, P( X > x) < 0.5 and P( X < x) > 0.5 yields x — 4. 

(b) p = 0.80, P{Y > y) > 0.8 and P(Y < y) < 0.2 yields y = 14. 


5.29 Using the hypergeometric distribution, we get 

(' 12 V 40 ') 

(a) i W i = 0.3246. 

(b) 1 - ||| = 0.4496. 

5.30 P(X > 1) = 1 - P(X = 0) = 1 - h( 0; 15, 3, 6) = 1 - W = |- 

( 4 )( 5 ) 

5.31 Using the hypergeometric distribution, we get h(2;9, 6, 4) = [2 / g \ 4 ' = 

5.32 (a) Probability that all 4 hre = h( 4; 10,4, 7) = 1. 

2 

(b) Probability that at most 2 will not hre = )T) h(x ; 10,4,3) = ||. 

x=0 


5.33 h(x; 6, 3, 4) = , for x = 1, 2, 3. 

P( 2 < X < 3) = h( 2; 6, 3, 4) + h( 3; 6, 3, 4) 


5.34 h( 2; 9, 5, 4) 



10 
21 - 


4 

5 ‘ 


5.35 P(X <2) = J2 h{x] 50, 5, 10) = 0.9517. 

x=0 

5.36 (a) P(X = 0) = h( 0; 25, 3, 3) = 

(b) P(X = l) = /r(l;25,3,l) = |. 

5.37 (a) P(X = 0) = 5(0; 3, 3/25) = 0.6815. 
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3 

(b) P( 1 <X <3) = J2 b (x] 3, 1/25) = 0.1153. 

X=1 

5.38 Since p = (4) (3/10) = 1.2 and a 2 = (4) (3/10) (7/10) (6/9) = 504/900 with a = 0.7483, 
at least 3/4 of the time the number of defectives will fall in the interval 

p ± 2a = 1.2 ± (2) (0.7483), or from - 0.297 to 2.697, 

and at least 8/9 of the time the number of defectives will fall in the interval 

p ± 3a = 1.2 ± (3) (0.7483) or from - 1.045 to 3.445. 


5.39 Since p = (13)(13/52) = 3.25 and a 2 = (13) (1/4) (3/4) (39/51) = 1.864 with a = 1.365, 
at least 75% of the time the number of hearts lay between 

P it 2cr = 3.25 ± (2) (1.365) or from 0.52 to 5.98. 


5.40 The binomial approximation of the hypergeometric with p — 1 — 4000/10000 = 0.6 

7 

gives a probability of )T) b(x; 15,0.6) = 0.2131. 

x=0 

5.41 Using the binomial approximation of the hypergeometric with p = 0.5, the probability 

2 

is 1 - £ b(x; 10, 0.5) = 0.9453. 

x=0 

5.42 Using the binomial approximation of the hypergeometric distribution with p = 30/150 = 

2 

0.2, the probability is 1 — K x 'i 10,0.2) = 0.3222. 

x=0 

5.43 Using the binomial approximation of the hypergeometric distribution with 0.7, the 

13 

probability is 1 — )T) b(x; 18,0.7) = 0.6077. 

x=10 

5.44 Using the extension of the hypergeometric distribution the probability is 


?)(?)(?)© 



0.0129. 


5.45 (a) The extension of the hypergeometric distribution gives a probability 

QQQ (?) 4 

(?) 33 ' 


(b) Using the extension of the hypergeometric distribution, we have 

(DQQ , Q (?) CD , (?) 9 CD s 

(?) (?) (?) 165 ' 
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5.46 Using the extension of the hypergeometric distribution the probability is 


;)©© , ©00 , © 0 © 


© 


+ 


© 


+ 


0 


17 

63' 


nowiSN 

5.47 h( 5; 25, 15, 10) = 1 5 / 25 \ loJ = 0.2315. 

usj 


(a) 

(?)(?) 

= 0.4762 

(b) 

a)(?) 

= 0.0952 


(a) 

m\ 7 ) 

= 0.3991 

( 2 5 °) 

(b) 


= 0.1316 

IT) 


5.50 N = 10000, n = 30 and k = 300. Using binomial approximation to the hypergeometric 
distribution with p = 300/10000 = 0.03, the probability of > 1} can be determined 
by 


1 - 6(0; 30, 0.03) = 1 - (0.97) 30 = 0.599. 

5.51 Using the negative binomial distribution, the required probability is 

6* (10; 5, 0.3) = Q^(0.3) 5 (0.7) 5 = 0.0515. 

5.52 From the negative binomial distribution, we obtain 

6* (8; 2, 1/6) = Q (1/ 6) 2 (5/ 6) 6 = 0.0651. 

00 5 

5.53 (a) P(X > 5) = £ p(x- 5) = 1 - E P(U 5) = 0.3840. 

x=6 x=0 

(b) P(X = 0) = p(0; 5) = 0.0067. 

5.54 (a) Using the negative binomial distribution, we get 

6*(7;3,l/2)= Q(l/2) 7 = 0.1172. 

(b) From the geometric distribution, we have g( 4; 1/2) = (l/2)(l/2) 3 = 1/16. 
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5.55 The probability that all coins turn up the same is 1/4. Using the geometric distribution 
with p = 3/4 and q = 1/4, we have 

P( X < 4) = £ 0 ( z ; 3 / 4 ) = X>/4)(l/4r‘ = // 


5.56 (a) Using the geometric distribution, we have g( 5; 2/3) = (2/3)(l/3) 4 = 2/243. 
(b) Using the negative binomial distribution, we have 

6'(S;3,2/3) = F)(2/3) 3 (l/3) 2 = h 


5.57 Using the geometric distribution 

(a) P(X = 3) = <?(3; 0.7) = (0.7)(0.3) 2 = 0.0630. 

3 3 

(b) P(X <A)=J2 g(x; 0.7) = £ (0.7) (0.3)*- 1 = 0.9730. 

X=1 X=1 

5.58 (a) Using the Poisson distribution with x = 5 and g = 3, we find from Table A. 2 that 

5 4 

p( 5; 3) = 3) - J>(z; 3) = 0.1008. 

x=0 x=0 

(b) P(X < 3) = P(X < 2) = 0.4232. 

(c) P(X > 2) = 1 - P(X < 1) = 0.8009. 

5.59 (a) P(X > 4) = 1 - P(X < 3) = 0.1429. 

(b) P(X = 0) = p( 0; 2) = 0.1353. 

5.60 (a) P(X < 4) = P(X < 3) = 0.1512. 

(b) P { 6 < X < 8) = P(X < 8) - P(X < 5) = 0.4015. 

5.61 (a) Using the negative binomial distribution, we obtain 

b*( 6; 4, 0.8) = Q) (0.8) 4 (0.2) 2 = 0.1638. 

(b) From the geometric distribution, we have <7 (3; 0.8) = (0.8) (0.2) 2 = 0.032. 

5.62 (a) Using the Poisson distribution with g = 12, we find from Table A. 2 that 

P(X < 7) = P(X < 6) = 0.0458. 

(b) Using the binomial distribution with p = 0.0458, we get 

6(2; 3, 0.0458) = ffj (0.0458) 2 (0.9542) = 0.0060. 




5.66 (a) p — np — (1875) (0.004) = 7.5, so P(X < 5) = P(X < 4) « 0.1321. 

(b) P ( 8 < X < 10) = P(X < 10) - P(X < 7) « 0.8622 - 0.5246 = 0.3376. 

5.67 (a) p = (2000) (0.002) = 4 and a 2 = 4. 

(b) For k = 2, we have p ± 2cr = 4 ± 4 or from 0 to 8. 

5.68 (a) p = (10000) (0.001) = 10 and <r 2 = 10. 

(b) For k — 3, we have p ± 3cr = 10 ± 3\^10 or from 0.51 to 19.49. 

5.69 (a) P(X < 3 1 At = 5) = 0.2650. 

(b) P(X > 1 1 At = 5) = 1 - 0.0404 = 0.9596. 

5.70 (a) P(X = 4|A t = 6) = 0.2851 - 0.1512 = 0.1339. 

(b) P(X > 4|A t = 6) = 1 - 0.1512 = 0.8488. 

74 

(c) P(X > 75|A t = 72) = 1 - E P{x] 74) = 0.3773. 

2 = 0 

5.71 (a) P(X > 10| At = 14) = 1 - 0.1757 = 0.8243. 

(b) At = 14. 

5.72 p = np= (1875) (0.004) = 7.5. 

5.73 p = (4000)(0.001) = 4. 
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5.74 p = 1 and a 2 = 0.99. 

5.75 fi = \t= (1.5)(5) = 7.5 and P(X = 0|At = 7.5) = e~ 7 - 5 = 5.53 x 10~ 4 . 

5.76 (a) P(X < 1 1 At = 2) = 0.4060. 

(b) p = \t = (2)(5) = 10 and P(X < 4|A t = 10) = 0.0293. 

5.77 (a) P(X > 10| At = 5) = 1 - P(X < 10| At = 5) = 1 - 0.9863 = 0.0137. 

(b) p = X t = (5)(3) = 15, so P(X > 20| At = 15) = 1 - P(X < 20| A = 15) = 
1 - 0.9170 = 0.0830. 

5.78 p = 0.03 with a ^(a:; 0.03). So, P(X = 16) = (0.03)(1 - 0.03) 15 = 0.0190 and 

t jt= ok~ 1 = 32 - 33 - 

5.79 So, Let Y = number of shifts until it fails. Then Y follows a geometric distribution 
with p = 0.10. So, 

P(Y < 6) = <7(1; 0.1) + g( 2; 0.1) + • • • + g( 6; 0.1) 

= (0.1)[1 + (0.9) + (0.9) 2 + • • • + (0.9) 5 ] = 0.4686. 


5.80 (a) The number of people interviewed before the first refusal follows a geometric 

distribution with p = 0.2. So. 

00 (1 — 0 2 V 50 

P(X > 51) = (0.2)(1 - 0.2) 1 = (0.2) 1 b = 0.00001, 

x=51 ' ' ' 

which is a very rare event. 

(b) p = ^-1 = 4. 

5.81 n = 15 and p = 0.05. 

i 

(a) P(X > 2) = 1 - P(X < 1) = 1 - E ©(0-05) a; (l - 0.05) 15-1 = 1 - 0.8290 = 

x=0 

0.1710. 

(b) p = 0.07. So, P(X < 1) = E ( 1 a 5 )(0-07) a: (l - 0.07) 15-1 = 1 - 0.7168 = 0.2832. 

x=0 

5.82 n = 100 and p = 0.01. 

3 

(a) P(X > 3) = 1 - P(X < 3) = 1 - E ( 1 °°)(°- 01 ) x ( 1 - O.Ol) 100 "^ = 1 - 0.9816 = 

#=0 

0.0184. 

3 

(b) For p = 0.05, P(X < 3) = E (T) (0.05) x (l - O.OS) 100 " 1 = 0.2578. 

x=0 
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5.83 Using the extension of the hypergeometric distribution, the probability is 


© 


0.0308. 


5.84 A = 2.7 call/min. 

(a) P(X < 4) = E e ~ 2 ' 7 £- 7)X = 0.8629. 

x=0 

(b) P(X < 1) = E e ~ 2 ' 7 S.' 7)X = 0-2487. 

x=0 

(c) A t = 13.5. So, 

10 -13 5 / 

P(X > 10) = 1 - P(X < 10) = 1 - E £ — ' j, 13,5)T = 1 - 0.2971 = 0.7129. 

x=0 

5.85 n = 15 and p = 0.05. 

(a) P(X = 5) = ( 1 5 5 )(0.05) 5 (1 - 0.05) 10 = 0.000562. 

(b) I would not believe the claim of 5% defective. 

5.86 A = 0.2, so A t = (0.2)(5) = 1. 

(a) P(X < 1) = E £Z yr n = 2e_1 = 0.7358. Hence, P(X > 1) = 1-0.7358 = 0.2642. 

x=0 

(b) A = 0.25, so At = 1.25. P(X\lel) = E e ~ 1 ' 25 ;r ( ! L25)3! = 0.6446. 

x=0 

1 

5.87 (a) 1 - P(X < 1) = 1 - E ( 1 °°) (0.01) a; (0.99) 100_3; = 1 - 0.7358 = 0.2642. 

x=0 

(b) P(X < 1) = E ( 1 °°) (0.05) x (0.95) lot)-3: = 0.0371. 

x=0 

5.88 (a) 100 visits/60 minutes with At = 5 visits/3 minutes. P( X = 0) = e J, 5 ” = 0.0067. 

5 

(b) P(X > 5) = 1 — E = 1 — 0-6160 = 0.3840. 

x=0 

5.89 (a) P(X > 1) = 1 - P(X = 0) = 1 - Q (|)° (|) 4 = 1 - 0.4822 = 0.5177. 

(b) P(X > 1) = 1 - P(X = 0) = 1 - ( 2 0 4 ) (i)° (f ) 24 = 1 - 0.5086 = 0.4914. 

5.90 n = 5 and p = 0.4; P(X > 3) = 1 - P(X < 2) = 1 - 0.6826 = 0.3174. 

5.91 (a) p — bp — (200)(0.03) = 6. 

(b) cr 2 = npq = 5.82. 
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(c) P( X = 0) = - = 0.0025 (using the Poisson approximation). 

P( X = 0) = (0.97) 200 = 0.0023 (using the binomial distribution). 

5.92 (a) p 10 q° = (0.99) 10 = 0.9044. 

(b) p 10 q 12 - 10 = (0.99) 10 (0.01) 2 = (0.9044)(0.0001) = 0.00009. 

5.93 n = 75 with p = 0.999. 

(a) X = the number of trials, and P( X = 75) = (0.999) 75 (0.001)° = 0.9277. 

(b) Y = the number of trials before the first failure (geometric distribution), and 
P(Y = 20) = (0.001)(0.999) 19 = 0.000981. 

(c) 1 - P(no failures) = 1 - (0.001)°(0.999) 10 = 0.01. 

5.94 (a) (™)pq 9 = (10)(0.25)(0.75) 9 = 0.1877. 

(b) Let X be the number of drills until the first success. X follows a geometric 
distribution with p = 0.25. So, the probability of having the first 10 drills being 
failure is q 10 = (0.75) 10 = 0.056. So, there is a small prospects for bankruptcy. 
Also, the probability that the first success appears in the 11th drill is pq 10 = 0.014 
which is even smaller. 

5.95 It is a negative binomial distribution. (*~ \)p k q x ~ k = (jjl}) (0.25) 2 (0.75) 4 = 0.0989. 

5.96 It is a negative binomial distribution. 1 1 )p k q x ~ k = Ql}) (0.5) 2 (0.5) 2 = 0.1875. 

5.97 n = 1000 and p = 0.01, with p = (1000) (0.01) = 10. P(X < 7) = P(X < 6) = 0.1301. 

5.98 n = 500; 

(a) If p — 0.01, 


P(X > 15) 


1 - P(X < 14) 



(0.01) a; (0.99) 500_a; 


0 . 00021 . 


This is a very rare probability and thus the original claim that p = 0.01 is ques- 
tionable. 


(b) P(X = 3) = ( 5 °°)(0.01) 3 (0.99) 497 = 0.1402. 

(c) For (a), if p — 0.01, p = (500)(0.01) = 5. So 


P{X > 15) = 1 - P(X < 14) = 1 - 0.9998 = 0.0002. 


For (b), 


P(X = 3) = 0.2650 - 0.1247 = 0.1403. 


5.99 N = 50 and n = 10. 
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( 2 \( 48 \ 

(a) k = 2; P(X > 1) = 1 - P(X = 0) = 1 - = 1 - 0.6367 = 0.3633. 

uoj 

(b) Even though the lot contains 2 defectives, the probability of reject the lot is not 
very high. Perhaps more items should be sampled. 

(c) p = (10)(2/50) = 0.4. 

5.100 Suppose n items need to be sampled. P{ X > 1) = 1 — ^ = 1 — > 0.9. 

The solution is n = 34. 

5.101 Define X = number of screens will detect. Then X ~ b(x‘, 3,0.8). 

(a) P(X = 0) = (1 - 0.8) 3 = 0.008. 

(b) P(X = 1) = (3)(0.2) 2 (0.8) = 0.096. 

(c) P(X > 2) = P(X = 2) + P(X = 3) = (3)(0.8) 2 (0.2) + (0.8) 3 = 0.896. 

5.102 (a) P(X = 0) = (1 - 0.8) n < 0.0001 implies that n > 6. 

(b) (1 — p) 3 < 0.0001 implies p > 0.9536. 

5.103 n = 10 and p = ^ = 0.04. 

P(X > 1) = 1 - P(X = 0) « 1 - (0.04)°(1 - 0.04) 10 = 1 - 0.6648 = 0.3351. 

The approximation is not that good due to = 0.2 is too large. 

5.104 (a) P = SP = 0.1. 

(b) P = ^ = 0.2. 

5.105 n = 200 with p = 0.00001. 

4 

(a) P(X > 5) = 1 - P(X < 4) = 1 - £ ( 2 “)(0.00001) :r (l - O.OOOOl) 200 ^ « 0. This 

x=0 

is a rare event. Therefore, the claim does not seem right. 

(b) p — np — (200) (0.00001) = 0.02. Using Poisson approximation, 

P(X > 5) = 1 - P(X < 4) w 1 - e" 0 - 02 ^’^ = 0. 
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6.1 (a) Area=0.9236. 

(b) Area=l - 0.1867 = 0.8133. 

(c) Area=0.2578 - 0.0154 = 0.2424. 

(d) Area=0.0823. 

(e) Area=l - 0.9750 = 0.0250. 

(f) Area=0.9591 - 0.3156 = 0.6435. 

6.2 (a) The area to the left of z is 1 — 0.3622 = 0.6378 which is closer to the tabled value 

0.6368 than to 0.6406. Therefore, we choose z = 0.35. 

(b) From Table A. 3, z = —1.21. 

(c) The total area to the left of z is 0.5000+0.4838=0.9838. Therefore, from Table 
A. 3, z = 2.14. 

(d) The distribution contains an area of 0.025 to the left of — z and therefore a total 

area of 0.025+0.95=0.975 to the left of z. From Table A. 3, z = 1.96. 

6.3 (a) From Table A. 3, k = —1.72. 

(b) Since P(Z > k) = 0.2946, then P(Z < k) = 0.7054/ From Table A. 3, we find 
k = 0.54. 

(c) The area to the left of z — —0.93 is found from Table A. 3 to be 0.1762. Therefore, 
the total area to the left of k is 0.1762+0.7235=0.8997, and hence k = 1.28. 

6.4 (a) z = (17 - 30) / 6 = -2.17. Area=l - 0.0150 = 0.9850. 

(b) z = (22 - 30) / 6 = -1.33. Area=0.0918. 

(c) zi = (32 — 3)/6 = 0.33, z 2 = (41-30)/6 = 1.83. Area = 0.9664-0.6293 = 0.3371. 

(d) z = 0.84. Therefore, x = 30 + (6) (0.84) = 35.04. 
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(e) z\ = —1.15, Z 2 = 1.15. Therefore, x\ = 30 + (6)(— 1.15) = 23.1 and x 2 = 
30 + (6)(1.15) = 36.9. 

6.5 (a) z = ( 15- 18)/2.5 = -1.2; P(X < 15) = P(Z < -1.2) = 0.1151. 

(b) * = -0.76, k = (2.5)(— 0.76) + 18 = 16.1. 

(c) z = 0.91, k = (2.5)(0.91) + 18 = 20.275. 

(d) Z! = (17 - 18)/2.5 = -0.4, z 2 = (21 - 18)/2.5 = 1.2; 

P(17 < X < 21) = P(— 0.4 < Z < 1.2) = 0.8849 - 0.3446 = 0.5403. 

6.6 zi = [(p — 3 o) — fi]/a = —3, Z 2 = [(/ a + 3 a) — p\/o = 3; 

P(p-3o < Z < /i + 3o) = P(— 3 <Z < 3) = 0.9987 - 0.0013 = 0.9974. 

6.7 (a) z = ( 32 - 40)/6.3 = -1.27; P(X > 32) = P(Z > -1.27) = 1 - 0.1020 = 0.8980. 

(b) z = ( 28- 40)/6.3 = -1.90, P(X < 28) = P(Z < -1.90) = 0.0287. 

(c) Z! = (37 - 40)/6.3 = -0.48, £ 2 = (49 - 40)/6.3 = 1.43; 

So, P( 37 < X < 49) = P(— 0.48 < Z < 1.43) = 0.9236 - 0.3156 = 0.6080. 

6.8 (a) z = (31.7 - 30)/2 = 0.85; P(X > 31.7) = P(Z > 0.85) = 0.1977. 

Therefore, 19.77% of the loaves are longer than 31.7 centimeters. 

(b) Z\ = (29.3 - 30) / 2 = -0.35, z 2 = (33.5 - 30)/2 = 1.75; 

P(29.3 < X < 33.5) = P(— 0.35 < Z < 1.75) = 0.9599 - 0.3632 = 0.5967. 
Therefore, 59.67% of the loaves are between 29.3 and 33.5 centimeters in length. 

(c) z = (25.5 - 30) / 2 = -2.25; P(X < 25.5) = P(Z < -2.25) = 0.0122. 

Therefore, 1.22% of the loaves are shorter than 25.5 centimeters in length. 

6.9 (a) z = (224 — 200)/15 = 1.6. Fraction of the enps containing more than 224 mil- 

limeters is P(Z > 1.6) = 0.0548. 

(b) Z\ = (191 - 200)/15 = -0.6, Z 2 = (209 - 200)/15 = 0.6; 

P(191 < X < 209) = P(— 0.6 < Z < 0.6) = 0.7257 - 0.2743 = 0.4514. 

(c) z = (230 - 200)/15 = 2.0; P(X > 230) = P(Z > 2.0) = 0.0228. Therefore, 
(1000) (0.0228) = 22.8 or approximately 23 cups will overflow. 

(d) z = -0.67, x = (15) (-0.67) + 200 = 189.95 millimeters. 

6.10 (a) z = (10.075 - 10.000)/0.03 = 2.5; P(X > 10.075) = P(Z > 2.5) = 0.0062. 

Therefore, 0.62% of the rings have inside diameters exceeding 10.075 cm. 

(b) zi = (9.97 - 10) /0.03 = -1.0, ^ 2 = (10.03 - 10)/0.03 = 1.0; 

P(9.97 < X < 10.03) = P(-1.0 < Z < 1.0) = 0.8413 - 0.1587 = 0.6826. 

(c) z = -1.04, x = 10 + (0.03)(— 1.04) = 9.969 cm. 

6.11 (a) z = (30 - 24)/3.8 = 1.58; P(X > 30) = P(Z > 1.58) = 0.0571. 

(b) z = (15 - 24)/3.8 = -2.37; P(X > 15) = P(Z > -2.37) = 0.9911. He is late 
99.11% of the time. 
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(c) z={ 25 - 24)/3.8 = 0.26; P(X > 25) = P(Z > 0.26) = 0.3974. 

(d) z = 1.04, x = (3.8) (1.04) + 24 = 27.952 minutes. 

(e) Using the binomial distribution with p = 0.0571, we get 
6(2; 3, 0.0571) = g) (0.0571) 2 (0.9429) = 0.0092. 

6.12 [i = 99.61 and o = 0.08. 

(a) P(99.5 < X < 99.7) = P(-1.375 < Z < 1.125) = 0.8697 - 0.08455 = 0.7852. 

(b) P(Z > 1.645) = 0.05; x = (1.645)(0.08) + 99.61 = 99.74. 

6.13 z = -1.88, x = (2)(— 1.88) + 10 = 6.24 years. 

6.14 (a) z = (159.75 - 174.5)/6.9 = -2.14; P(X < 159.75) = P(Z < -2.14) = 0.0162. 

Therefore, (1000) (0.0162) = 16 students. 

(b) zi = (171.25 - 174.5)/6.9 = -0.47, z 2 = (182.25 - 174.5)/6.9 = 1.12. 

P(171.25 < X < 182.25) = P(-0.47 < Z < 1.12) = 0.8686 - 0.3192 = 0.5494. 
Therefore, (1000) (0.5494) = 549 students. 

(c) zi = (174.75 - 174.5)/6.9 = 0.04, z 2 = (175.25 - 174.5)/6.9 = 0.11. 

P(174.75 < X < 175.25) = P(0.04 < Z < 0.11) = 0.5438 - 0.5160 = 0.0278. 
Therefore, (1000)(0.0278)=28 students. 

(d) z = (187.75 - 174.5)/6.9 = 1.92; P(X > 187.75) = P(Z > 1.92) = 0.0274. 
Therefore, (1000) (0.0274) = 27 students. 

6.15 /i = $15.90 and a = $1.50. 

(a) 51%, since P(13.75 < X < 16.22) = P ( 147 U_ 1 M < ^ < 16 . 225 - 15 . 9 ) 

= P(-1.437 < Z < 0.217) = 0.5871 - 0.0749 = 0.5122. 

(b) $18.36, since P(Z > 1.645) = 0.05; x = (1.645)(1.50) + 15.90 + 0.005 = 18.37. 

6.16 (a) z = (9.55 — 8)/0.9 = 1.72. Fraction of poodles weighing over 9.5 kilograms = 

P(X > 9.55) = P(Z > 1.72) = 0.0427. 

(b) z = (8.65 — 8)/0.9 = 0.72. Fraction of poodles weighing at most 8.6 kilograms = 
P(X < 8.65) = P(Z < 0.72) = 0.7642. 

(c) Z\ = (7.25 - 8)/0.9 = -0.83 and ^ 2 = (9.15 - 8)/0.9 = 1.28. 

Fraction of poodles weighing between 7.3 and 9.1 kilograms inclusive 

= P(7.25 < X < 9.15) = P(— 0.83 < Z < 1.28) = 0.8997 - 0.2033 = 0.6964. 

6.17 (a) z = (10,175 — 10, 000)/100 = 1.75. Proportion of components exceeding 10.150 

kilograms in tensile strength^ P(X > 10, 175) = P(Z > 1.75) = 0.0401. 

(b) zi = (9,775 - 10,000)/100 = -2.25 and z 2 = (10,225 - 10, 000)/100 = 2.25. 
Proportion of components scrapped= P( X < 9, 775) + P( X > 10, 225) = P(Z < 
-2.25) + P(Z > 2.25) = 2 P(Z < -2.25) = 0.0244. 
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6.18 (a) x\ — p + 1.3cr and X 2 = p — 1.3a. Then z\ = 1.3 and Z 2 = —1.3. P{ X > 

p+1.3 a)+P(X < 1.3a) = P(Z > 1.3 ) + P(Z < -1.3) = 2 P(Z < -1.3) = 0.1936. 
Therefore, 19.36%. 

(b) x\ = p+ 0.52a and X 2 = p— 0.52cr. Then zi = 0.52 and z 2 = —0.52. P(p— 0.52a < 
X < p + 0.52a) = P(— 0.52 < Z < 0.52) = 0.6985 - 0.3015 = 0.3970. Therefore, 
39.70%. 

6.19 z = (94.5 - 115)/12 = -1.71; P(X < 94.5) = P(Z < -1.71) = 0.0436. Therefore, 
(0.0436) (600) = 26 students will be rejected. 

6.20 f(x) = for A < x < B. 


(a) fi = \l~^ydx 


B 2 -A 2 _ A+B 
2 (B-A) 2 ' 


(b) E( A' 2 ) = /f 


B-A 


So, a 2 


B 3 -A 3 
3 (B-A) 


dx = 

(4±fi 


B 3 -A 3 
3 (B-A)' 

\ 2 4(B 2 +AB+A 2 )-3(B 2 +2AB+A 2 ) _ B 2 -2AB+A 2 _ (B-A) 2 

) ~ 12 — 12 — 12 


6.21 A = 7 and B = 10. 

(a) P(X < 8.8) = = 0.60. 

(b) P(7.4 < X < 9.5) = = 0.70. 

(c) P(X > 8.5) = = o.5o. 

6.22 (a) P(X > 7) = ^ = 0.3. 

(b) P(2 < X < 7) = = 0.5. 

6.23 (a) From Table A.l with n = 15 and p = 0.2 we have 

4 

P(1 < x < 4) = E b{x; 15, 0.2) - 6(0; 15, 0.2) = 0.8358 - 0.0352 = 0.8006. 

x=0 

(b) By the normal-curve approximation we first find 

H — rip — 3 and then a 2 = npq = (15) (0.2) (0.8) = 2.4. Then a = 1.549. 

Now, z 1 = (0.5 - 3)/ 1 .549 = -1.61 and z 2 = (4.5 - 3)/1.549 = 0.97. 

Therefore, P(1 < X < 4) = P(-1.61 < Z < 0.97) = 0.8340 - 0.0537 = 0.7803. 

6.24 p — np — (400) (1/2) = 200, a = y/rvpq = v /(400)(l/2)(l/2) = 10. 


(a) z i = (184.5 - 200)/10 = -1.55 and = (210.5 - 200)/10 = 1.05. 

P(184.5 < X < 210.5) = P(— 1.55 < Z < 1.05) = 0.8531 - 0.0606 = 0.7925. 

(b) zi = (204.5 - 200)/10 = 0.45 and ^ 2 = (205.5 - 200)/10 = 0.55. 

P(204.5 < X < 205.5) = P(0.45 < Z < 0.55) = 0.7088 - 0.6736 = 0.0352. 

(c) zi = (175.5 - 200)/10 = -2.45 and ^ = (227.5 - 200)/10 = 2.75. 

P(X < 175.5) + P(X > 227.5) = P(Z < -2.45) + P(Z > 2.75) 

= P(Z < -2.45) + 1 - P(Z < 2.75) = 0.0071 + 1 - 0.9970 = 0.0101. 
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6.25 n = 100. 

(a) p = 0.01 with p = (100) (0.01) = 1 and a = ^/(lOO) (0.01) (0.99) = 0.995. 

So, z = (0.5 - 1) /0.995 = -0.503. P(X < 0) « P(Z < -0.503) = 0.3085. 

(b) p = 0.05 with p = (100) (0.05) = 5 and = ^(100) (0.05) (0.95) = 2.1794. 

So, z = (0.5 - 5)/2.1794 = -2.06. P(X < 0) « P(X < -2.06) = 0.0197. 

6.26 p — np — (100) (0.1) = 10 and = ^(100)(0.1)(0.9) = 3. 

(a) z = (13.5 - 10)/3 = 1.17; P(X > 13.5) = P(Z > 1.17) = 0.1210. 

(b) z = (7.5 - 10) / 3 = -0.83; P(X < 7.5) = P(Z < -0.83) = 0.2033. 

6.27 p = (100) (0.9) = 90 and a = ^(100)(0.9)(0.1) = 3. 

(a) zi = (83.5 - 90) / 3 = -2.17 and = (95.5 - 90)/3 = 1.83. 

P(83.5 < X < 95.5) = P(— 2.17 < Z < 1.83) = 0.9664 - 0.0150 = 0.9514. 

(b) z = (85.5 - 90) / 3 = -1.50; P(X < 85.5) = P(Z < -1.50) = 0.0668. 

6.28 p = (80) (3/4) = 60 and a = ■ N /(80)(3/4)(l/4) = 3.873. 

(a) z = (49.5 - 60)/3.873 = -2.71; P(X > 49.5) = P(Z > -2.71) = 1 - 0.0034 = 
0.9966. 

(b) z = (56.5 - 60)/3.873 = -0.90; P(X < 56.5) = P(Z < -0.90) = 0.1841. 

6.29 p = (1000) (0.2) = 200 and a = ^/(lOOO) (0.2) (0.8) = 12.649. 

(a) zi = (169.5 - 200)/12.649 = -2.41 and = (185.5 - 200)/12.649 = -1.15. 
P(169.5 < X < 185.5) = P(— 2.41 < Z < -1.15) = 0.1251 - 0.0080 = 0.1171. 

(b) Z\ = (209.5 - 200)/12.649 = 0.75 and ^ 2 = (225.5 - 200)/12.649 = 2.02. 

P(209.5 < X < 225.5) = P(0.75 < Z < 2.02) = 0.9783 - 0.7734 = 0.2049. 

6.30 (a) p = (100) (0.8) = 80 and a = ^(100)(0.8)(0.2) = 4 with z = (74.5 - 80)/4 = 

-1.38. 

P(Claim is rejected when p = 0.8) = P(Z < —1.38) = 0.0838. 

(b) p = (100) (0.7) = 70 and cr = ^(100) (0.7) (0.3) = 4.583 with z = (74.5 - 
70)/4.583 = 0.98. 

P(Claim is accepted when p = 0.7) = P(Z > 0.98) = 1 — 0.8365 = 0.1635. 

6.31 p = (180) (1/6) = 30 and a = ^(180) (1/6) (5/6) = 5 with z = (35.5 - 30)/5 = 1.1. 
P(X > 35.5) = P(Z > 1.1) = 1 - 0.8643 = 0.1357. 

6.32 p = (200) (0.05) = 10 and a = ^(200) (0.05) (0.95) = 3.082 with 

z = (9.5 - 10)/3.082 = -0.16. P(X < 10) = P(Z < -0.16) = 0.4364. 


6.33 p = (400) (1/10) = 40 and cr = v /(400)(l/10)(9/10) = 6. 
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(a) 2 = (31.5 - 40) / 6 = -1.42; P(X < 31.5) = P(Z < -1.42) = 0.0778. 

(b) 2 = (49.5 - 40)/6 = 1.58; P(X > 49.5) = P(Z > 1.58) = 1 - 0.9429 = 0.0571. 

(c) z 1 = (34.5 - 40)/6 = -0.92 and ^ 2 = (46.5 - 40)/6 = 1.08; 

P(34.5 < X < 46.5) = P(— 0.92 < Z < 1.08) = 0.8599 - 0.1788 = 0.6811. 

6.34 p = (180) (1/6) = 30 and cr = ^(180) (1/6) (5/6) = 5. 

(a) 2 = (24.5 - 30) / 5 = -1.1; P(X > 24.5) = P(Z > -1.1) = 1 - 0.1357 = 0.8643. 

(b) 2 i = (32.5 - 30) / 5 = 0.5 and z 2 = (41.5 - 30)/5 = 2.3. 

P( 32.5 < X < 41.5) = P(0.5 < Z < 2.3) = 0.9893 - 0.6915 = 0.2978. 

(c) z 1 = (29.5 - 30) / 5 = -0.1 and ^ 2 = (30.5 - 30)/5 = 0.1. 

P(29.5 < X < 30.5) = P(-0.1 < Z < 0.1) = 0.5398 - 0.4602 = 0.0796. 

6.35 (a) p = 0.05, n = 100 with p = 5 and a = ^(100) (0.05) (0.95) = 2.1794. 

So, z = (2.5 - 5)/2.1794 = -1.147; P(X > 2) « P(Z > -1.147) = 0.8749. 

(b) 2 = (10.5 - 5)/2.1794 = 2.524; P(X > 10) « P(Z > 2.52) = 0.0059. 

6.36 n = 200; X = The number of no shows with p = 0.02. 2 = 3 - 0 . 5 — i _ _q yg 

y/ ( 200 )( 0 . 02 )( 0 . 98 ) 

Therefore, P(airline overbooks the flight) = 1 — P( X > 3) ~ 1 — P(Z > —0.76) = 
0.2236. 

6.37 (a) P(X > 230) = P (Z > = 0.0228. 

(b) Denote by Y the number of students whose serum cholesterol level exceed 230 
among the 300. Then Y ~ b(y; 300, 0.0228 with p = (300)(0.0228) = 6.84 and 
a = n /( 300)(0.0228)(1 - 0.0228) = 2.5854. So, z = 8 ~ 9 0 5 5 ~^ 84 = 0.26 and 
P(X > 8) « P(Z > 0.26) = 0.3974. 

6.38 (a) Denote by X the number of failures among the 20. X ~ b{x ; 20, 0.01) and P ( X > 

1) = 1—6(0; 20, 0.01)— 6(1; 20, 0.01) = 1- ( 2 Q °) (0.01)°(0.99) 20 - ( 2 °) (0.01)(0.99) 19 = 
0.01686. 

(b) n = 500 and p = 0.01 with p = (500) (0.01) = 5 and a = ^(500) (0.01) (0.99) = 
2.2249. So, P(more than 8 failures) « P(Z > (8.5 - 5)/2.2249) = P(Z > 1.57) = 
1 - 0.9418 = 0.0582. 

6.39 P(1.8 < X < 2.4) = /, 2 8 4 xe~ x dx = [~xe~ x - = 2.8e.- L8 - 3.4e.- 2 - 4 = 0.1545. 

6.40 P(X > 9) = | / 9 °° x ~ x/ 3 dx = [-f e"^ 3 - e"^ 3 ] |“ = 4e" 3 = 0.1992. 

6.41 Setting a = 1/2 in the gamma distribution and integrating, we have 


1 

7mm 


x -i/2 e -x/p dx 


1 . 
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Substitute x = y 2 / 2, dx = y dy, to give 



since the quantity in parentheses represents one-half of the area under the normal curve 

n(y, 

6.42 (a) P(X < 1) = 4 f 3 xe~ 2x dx = [ -2xe~ 2x - e~ 2x }\l = 1 - 3e“ 2 = 0.5940. 

(b) P(X > 2) = 4 f 0 °° xe~ 2x dx = [-2xe~ 2x - e.- 2x ]|“ = 5e" 4 = 0.0916. 

6.43 (a) p = a(3 = (2) (3) = 6 million liters; a 2 = a(3 2 = (2) (9) = 18. 

(b) Water consumption on any given day has a probability of at least 3/4 of falling 
in the interval p ± 2a = 6 ± 2y/l8 or from —2.485 to 14.485. That is from 0 to 
14.485 million liters. 

6.44 (a) p = a/3 = 6 and a 2 = a/3 2 = 12. Substituting a = 6 / [3 into the variance formula 

we find 6/3 = 12 or (3 = 2 and then a = 3. 

(b) P(X > 12) = ± /” x 2 e x / 2 dx. Integrating by parts twice gives 

P(X > 12) = -L [-2z 2 e“ x/2 - 8xe~ x/2 - lbe"^ 2 ] = 25e" 6 = 0.0620. 


6.45 P(X <3) = i f 0 3 e~ x / 4 dx = -e~ x/4 \ 3 Q = 1 - e“ 3 / 4 = 0.5276. 

Let Y be the number of days a person is served in less than 3 minutes. Then 

P(Y > 4) = £ b (y ; 6, 1 - e -3 / 4 ) = Q (0.5276) 4 (0.4724) 2 + (®) (0.5276) 5 (0.4724) 

rr=4 

+ (®) (0.5276) 6 = 0.3968. 

6.46 P( X < 1) = \ J* e~ x ! 2 dx = — e - ^ 2 ]^ = 1 — e _1//2 = 0.3935. Let Y be the number 
of switches that fail during the first year. Using the normal approximation we find 
p = (100)(0.3935) = 39.35, a = \/(100)(0.3935)(0.6065) = 4.885, and z = (30.5 - 
39.35)/4.885 = -1.81. Therefore, P(Y < 30) = P(Z < -1.81) = 0.0352. 


6.47 


(a) E( X) = J 0 °° x 2 e x 2 / 2 dx = — xe x 2 / 2 

= 0 + ■ ±; e- a/2 dx = 1 

(b) P(A" > 2) = J 2 °° xe~ x2 l 2 dx = —e~ x2 ^ 2 


+ / 0 °° e * 2//2 dx 

= 1.2533. 

OO 

= e -2 = 0.1353. 

2 


6.48 p = U(T) = a/3 t ,3 e atP dt. Let y = at 13 , then dy = a^t 13 1 dt and t = {y/a) 1 ^ 3 . 

Then 


3 = 


{y/a) 4 ^e y dy = a l ^ 3 / yd+i/4) i e v dy — a 1 + 1//3). 
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/»oo /»oo /»oo 

E(T 2 ) = a/3 t^ +l e~ atl3 dt — {y/a) 2 ^ 3 e~ y dy = aT 2 ^ / yl 1+2 /C-^ e -y jy 

Jo Jo Jo 

= cU 2//3 r(l + 2//3). 

So, a 2 = E(T 2 ) - /i 2 = «- 2 ^{r(l + 2/(3 ) - [r(l + 1 /(3)} 2 }. 

6.49 R(t) = ce~ dt = ce~ 2 ^. However, R( 0) = 1 and hence c — 1. Now 

fit) = Z(t)R(t) = e~ 2y/i /Vt, t > 0, 

and 

OO 

P(T > 4) = / e- 2Vi /Vt dt = -e~ 2Vi = e~ 4 = 0.0183. 

J 4 4 

6.50 /(x) = 12x 2 (l — x), 0 < x < 1. Therefore, 

P{X > 0.8) = 12 [ x 2 (l - x) dx = 0.1808. 

J 0.8 

6.51 a = 5; (3= 10; 

(a) a(3 = 50. 

(b) o' 2 = a(3 2 = 500; so a = = 22.36. 

(c) P( X > 30) = f™ x Q - 1 e ~ x /0 dx. Using the incomplete gamma with y = 

x/ (3, then 

r 3 v 4 e~ y 

1 - P(X < 30) = 1 - P(Y < 3) = 1 - / J-— dy = 1 - 0.185 = 0.815. 

Jo r (5) 

6.52 a(3 = 10; a = y/afPy/W = 7.07. 

(a) Using integration by parts, 

i p50 i p50 

P(X < 50) = - — — - / x a ~ 1 e~ x ^ dx = — / xe~ x ^ dx = 0.9995. 

1 ' (3 a Y{a) J Q 25 J 0 
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(a) P(X > 5) = / 5 °° \e~ x E dx = \ [-Se"^ 3 ] = e" 5 / 3 = 0.1889. 

(b) P(X > 10) = e" 10 / 3 = 0.0357. 

6.54 P(X > 270) = 1 - $ ( In27 2 °~ 4 ) = 1 - $(0.7992) = 0.2119. 

6.55 p = E( X) = e 4+4 / 2 = e 6 ; <j 2 = e 8+4 (e 4 — 1) = e 12 (e 4 — 1). 


6.56 (3 = 1/5 and a = 10. 


(a) P(X > 10) = 1 - P(X < 10) = 1 - 0.9863 = 0.0137. 

(b) P ( X > 2) before 10 cars arrive. 


P{X < 2) 


f 2 1 

o ^ r(«) 


dx. 


Given y = x//3, then 


P{X < 2) = P(Y < 10) 



with P(X > 2) = 1 - P(X < 2) = 

6.57 (a) P(X > 1) = 1 - P(X < 1) = 1 - 
(b) p = (3= 1/10 = 0.1. 


y a - l e~ y , f 10 y^e-y , 

Ti W iy= L L nw dv = 

-- 1 - 0.542 = 0.458. 

10 fo e ~ Wx dx = e ~ W = 0-000045. 


0.542, 


6.58 Assume that Z(t) = af3tP~ l , for t > 0. Then we can write /(f) = Z(t)R(t), where 
R[t ) = ce _ ^ z 0) dt — ce~f a/3tl3 ldt = ce~ at>3 . From the condition that i?(0) = 1, we find 
that c—1. Hence 77(f) = e atP and /(f) = a/7f /3 ” 1 e _ “ t/3 , for f > 0. Since 


Z(t) 


m 

R(ty 


where 

77(f) = 1 - F(f) = 1 - [ af3x^- l e~ oxP dx = 1 + [ de~ ax0 = e~ at \ 

J o Jo 

then 


Z(«) 


aftt 13 4 e Qtl3 
e~ atP 


aftt 13 l , f > 0. 


6.59 p — np — (1000) (0.49) = 490, = ^Wpq = ^(1000) (0.49) (0.51) = 15.808. 


Zi = 


481.5 - 490 
15.808 


= -0.54, Z 2 = 


510.5-490 
15.808 


= 1.3. 


P(481.5 < X < 510.5) = P(— 0.54 < Z < 1.3) = 0.9032 - 0.2946 = 0.6086. 
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6.60 P(X > 1/4) = f™ 4 6e~ 6x dx = -e~ Qx \™ A = e~ h5 = 0.223. 

6.61 P( X < 1/2) = 108 J^ 2 x 2 e~ 6x dx. Letting y = Ox and using Table A. 24 we have 

P{X < 1/2) = P(Y < 3) = f y 2 e~ v dy = 0.577. 

Jo 


6.62 Manufacturer A: 


„ 100000 - 14000\ 

PiX > 10000 = P [ z> 

v - ’ 1 - 2000 ) 


= P(Z > -2) = 0.9772. 


Manufacturer B: 




= P(Z > -3) = 0.9987. 


Manufacturer B will produce the fewest number of defective rivets. 

6.63 Using the normal approximation to the binomial with p — np — 650 and cr = y/npq = 
15.0831. So, 


P(590 < X < 625) = P(— 10.64 < Z < -8.92) « 0. 


6.64 (a) // — (i — 100 hours. 

(b) P(X > 200) = 0.01 f™ 0 e-° 01x dx = e~ 2 = 0.1353. 

6.65 (a) p = 85 and o = 4. So, P(X < 80) = P(Z < -1.25) = 0.1056. 

(b) p = 79 and a = 4. So, P(X > 80) = P(Z > 0.25) = 0.4013. 

6.66 1//3 = 1/5 hours with a = 2 failures and (3 = 5 hours. 

(a) a(3 = (2)(5) = 10. 

(b) P ( X > 12) = r^p ( 2 ) xe ~ X//5 dx = A xe~ x / 5 dx = [— |e _a://5 — 

= 0.3084. 

6.67 Denote by X the elongation. We have p = 0.05 and a = 0.01. 

(a) P(X > 0.1) = P{Z> ai 0 7 ( /, 05 ) = P(Z > 5) « 0. 

(b) P(X < 0.04) = P(Z < °-°g-;- 05 ) = P(Z < -1) = 0.1587. 

(c) P(0.025 < X < 0.065) = P(-2.5 < Z < 1.5) = 0.9332 - 0.0062 = 0.9270. 

6.68 Let X be the error. X ~ n(x; 0,4). So, 

P (fails) = 1 - P(-10 < X < 10) = 1 - P(— 2.25 < Z < 2.25) = 2(0.0122) = 0.0244. 
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6.69 Let X be the time to bombing with p — 3 and cr = 0.5. Then 

P( 1 < X < 4) = P ( — - <Z< ^ ] = P (- 4 <Z< 2)= 0.9772. 

\ 0.5 0.5 ) 

P(of an undesirable product) is 1 — 0.9772 = 0.0228. Hence a product is undesirable 
is 2.28% of the time. 

6.70 a = 2 and (3 = 100. P( X < 200) = J? / 0 2 °° xe~ x ^ dx. Using the incomplete gamma 
table and let y — x/(3, ye~ v dy = 0.594. 

6.71 p = a/3 = 200 hours and a 2 = a/3 2 = 20, 000 hours. 

6.72 X follows a lognormal distribution. 

P(X > 50, 000) = 1 - $ ^ ln50 ’Q°°- = _ $(2.9099) = 1 - 0.9982 = 0.0018. 


6.73 The mean of X, which follows a lognormal distribution is p = E(X) 


— e M + o - 2 /2 


= e 7 . 


6.74 p — 10 and a = y/50. 

(a) P(X < 50) = P(Z < 5.66) « 1. 

(b) P(X < 10) = 0.5. 

(c) The results are very similar. 


6.75 


6.76 


(a) Since f(y) >0 and f* 10(1 — y) 9 dy = — (1 — ?/) 10 1 q = 1, it is a density function. 

(b) P(Y > 0.6) = - (1 - y) 10 \l 6 = (0.4) 10 = 0.0001. 


(c) a = 1 and (3 = 10. 

< d ) V- = ST? = IT = ° 0909 ' 


ct 2 = 


a (3 


( 1 )( 10 ) 


(a+/3) 2 (a+/3+l) (1+10) 2 (1+10+1) 


= 0.006887. 


(a) fJ. = ^ r ze z / 10 dz = - ze z / 10 |“ + f ( 

(b) Using integral by parts twice, we get 


*/ 10 dz 


10 . 


E(Z 2 ) 


1 

To 


z 2 e~ z/w dz 


200 . 


So, a 2 = E(Z 2 ) - p 2 = 200 - (10) 2 = 100. 
(c) P(Z > 10) = - e^ 10 |~ = e’ 1 = 0.3679. 


6.77 This is an exponential distribution with [3 = 10. 


(a) p = (3 = 10. 
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(b) a 2 = (3 2 = 100. 


6.78 // = 0.5 seconds and a = 0.4 seconds. 

(a) P(X > 0.3) = P(Z> pp) = P(Z > -0.5) = 0.6915. 

(b) P(Z > —1.645) = 0.95. So, —1.645 = pp yields x = —0.158 seconds. The 
negative number in reaction time is not reasonable. So, it means that the normal 
model may not be accurate enough. 


6.79 (a) For an exponential distribution with parameter /3, 


P{X > a + b\ X > a) 


P{X >a + b ) 
P(X > a) 



P(X > b). 


So, P(it will breakdown in the next 21 days | it just broke down) = P( X > 21) = 
e-2i/i5 = e -i.4 = 0.2466. 

(b) P(X > 30) = e- 30 / 15 = e~ 2 = 0.1353. 


6.80 a = 2 and (3 = 50. So, 


-10 


a^e" 2 * 50 dx. 


P(X < 10) = 100 [ 

Jo 

Let y = 2a; 50 with x = (y/ 2) 1 / 50 and dx = 9 i /5 d (50) ?/~ 49,/50 dy. 

100 


-( 2 ) 10 5 


P(X < 10) = 


2 1 / 50 (50) 


|) 49 / 5 %-^v» * = 


-( 2 ) 10 5 


e y dy « 1. 


6.81 The density function of a Weibull distribution is 

f{y) = a(3y^ l e~ a y\ y> 0. 


So, for any y > 0, 

F(y) = f f(t) dt — a(3 f t /3_1 e _a * d dt. 

Jo Jo 

Let z = t 13 which yields t = z 1 ^ 3 and dt = jj z 1 /P~ 1 dz. Hence, 

F(y) = a/3 f e ~™ dz = a T e" az dz = 1 - e~ a P 

Jo P Jo 

On the other hand, since de~ ay>3 = — a/3y^ 3 ~ 1 e~ ay>3 , the above result follows immedi- 
ately. 
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6.82 One of the basic assumptions for the exponential distribution centers around the “lack- 
of-memory” property for the associated Poisson distribution. Thus the drill bit of 
problem 6.80 is assumed to have no punishment through wear if the exponential dis- 
tribution applies. A drill bit is a mechanical part that certainly will have significant 
wear over time. Hence the exponential distribution would not apply. 

6.83 The chi-squared distribution is a special case of the gamma distribution when a = v/2 

and (3 = 2 , where v is the degrees of the freedom of the chi-squared distribution. 

So, the mean of the chi-squared distribution, using the property from the gamma 
distribution, is p = a(3 = (v/2) (2) = v, and the variance of the chi-squared distribution 
is cr' 2 = a(3 2 = (v/2)(2) 2 = 2v. 

6.84 Let A" be the length of time in seconds. Then Y = ln(AT) follows a normal distribution 
with p = 1.8 and a = 2. 

(a) P(X > 20) = P(Y > In 20) = P(Z > (In 20 - 1.8)/2) = P(Z > 0.60) = 0.2743. 

P(X > 60) = P(Y > In 60) = P(Z > (In 60 - 1.8)/2) = P(Z > 1.15) = 0.1251. 

(b) The mean of the underlying normal distribution is e L8 + 4 / 2 = 44.70 seconds. So, 
P(X < 44.70) = P(Z < (In 44.70 - 1.8)/2) = P(Z < 1) = 0.8413. 
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7.1 From y — 2a; — 1 we obtain x — (y + l)/2, and given x = 1, 2, and 3, then 

g(v) = f[(y + 1)/2] = 1 / 3 , for y = 1 , 3 , 5 . 


7.2 From y = x 2 , x = 0, 1, 2, 3, we obtain x = ^/y, 


g(y ) = f(Vv) 




for y = 0, 1, 4, 9. 


7.3 The inverse functions of y x = x\ + x 2 and y 2 = X\ — x 2 are xi = (yi + y 2 )/ 2 and 
X 2 = (yi — V2)/2- Therefore, 

{yi-y2)/2 / g 

V12 

where y, = 0, 1, 2, y 2 = -2, -1, 0, 1, 2, y 2 < y, and y x + y 2 = 0, 2, 4. 

7.4 Let W = X 2 . The inverse functions of y = X]X 2 and w = x 2 are x\ = y/w and x 2 = w, 
where y/w = 1, 2. Then 



y(yi,y 2 ) = yi+y2 
V 2 > 


V1-V2 

2 


,2 - yi 


(yi+V2)/2 


g(y,w ) 


(y/w)(w/ 18) = y/18, y = 1, 2, 3, 4, 6; w = 1, 2, 3, y/w = 1, 2. 


In tabular form the joint distribution g(y,w ) and marginal h(y) are given by 


g(y,w) 

1 

2 

y 

3 

4 

6 

l 

w 2 
3 

1/18 

2/18 

2/18 

3/18 

4/18 

6/18 

%) 

1/18 

2/9 

1/6 

2/9 

1/3 
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The alternate solutions are: 

P(Y =1)=/(1,1) = 1/18, ; 

P(Y = 2) = /( 1, 2) + / (2, 1) = 2/18 + 2/18 = 2/9, 

P(y = 3) = /(l, 3) = 3/18 = 1/6, 

P(Y' = 4) = / (2, 2) = 4/18 = 2/9, 

P(y = 6) = /( 2, 3) = 6/18 = 1/3. 

7.5 The inverse function of y = —2 In a: is given by x = e -2///2 from which we obtain 
| J| = | - e-' y/2 /2| = e~ y/2 /2. Now, 

g{y) = f{e y/2 )\J\ = e~ y/2 /2, y> 0, 
which is a chi-squared distribution with 2 degrees of freedom. 

7.6 The inverse function of y = 8x 3 is x = y 1 / 3 / 2, for 0 < y < 8 from which we obtain 
\J\ = y -2 / 3 / 6. Therefore, 

<?(?/) = f(y 1/3 / 2)|-/| = 2(2/ 1 /3/2)( 2/ - 2 /3/ 6) = V 1 / 3 , 0 < y < 8. 

6 


7.7 To find k we solve the equation k r 2 e 61,2 dv = 1. Let x = bv 2 , then dx = 26a? dv 
P - 1/2 


and dv = X 2 V dx. Then the equation becomes 

k f 


2 b 3 / 2 


x 


3/2 x e 1 dx = 1, 


or 


AT(3/2) 

26 3 / 2 


= 1 . 


Hence & = r(1 ^ . 

Now the inverse function of w = mv 2 / 2 is v = \J 2 w /m, for w > 0, from which we 
obtain \J\ = 1/V2 row. It follows that 


for ta > 0, which is a gamma distribution with a = 3/2 and (3 = m/26. 

7.8 (a) The inverse of y = a; 2 is a: = ^/y, for 0 < y < 1, from which we obtain \J\ = 1/2^/y. 

Therefore, 

^(j/) = fiVvM = 2 ( 1 - Vy)/ 2 Vy = y ~ 1/2 ~ l > 0 < y < l - 

(b) P(Y < 1) = /p 1 (y _1/2 - 1) dy = (2yV 2 - y)|J = 0.5324. 

7.9 (a) The inverse of y = x + 4 is a; = y — 4, for y > 4, from which we obtain | J| = 1. 

Therefore, 


y(y) = /(y-4)|J| = 32/y 3 , y > 4. 
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(b) P(Y > 8) = 32 / 8 °° y- 3 dy — — 16 y 


-2 1 oo 1 

4’ 


7.10 (a) Let W = X. The inverse functions of z = x + y and w = x are x = w and 
y = z — w, 0 < w < z, 0 < z < 1, from which we obtain 


J = 


1 dx 

dx 


1 0 

dw dz 


dy 

dw 

dy 

dz 


-1 1 


= 1 . 


Then g(w,z ) = f(w,z — w)\J\ = 24 w(z — w), for 0 < w < z and 0 < z < 1. The 
marginal distribution of Z is 

fi(z) — [ 24(z - w)w dw = 4z 3 , 0 < z < 1. 

Jo 

(b) P(l/2 < Z < 3/4) = 4 f*f*z 3 dz = 65/256. 

7.11 The amount of kerosene left at the end of the day is Z — Y — X. Let W = Y. The 
inverse functions of z = y — x and w = y are x = w — z and y = w, for 0 < z < w and 
0 < w < 1, from which we obtain 


J = 


dx 

dx 


1 -1 

chv 

dz 


dy 

dy 


1 0 

dw 

dz 



= 1 . 


Now, 


g(w, z) = g(w — z,w) — 2, 0 < z < w, 0 < w < 1, 

and the marginal distribution of Z is 

h(z) —2 f dw = 2(1 — z), 0 < z < 1. 

J Z 

7.12 Since X t and X 2 are independent, the joint probability distribution is 

f(xi,x 2 ) = f(xi)f(x 2 ) = e~ {xi+X2 \ x 1 > 0, x 2 > 0. 

The inverse functions of y\ — x\ + x 2 and y 2 = x\/{x\ + x 2 ) are x\ = y\y 2 and 
x 2 = 2/i(l — y 2 ), for y 1 > 0 and 0 < y 2 < 1, so that 


J = 


dxi/dyi dx\/dy 2 


V2 U\ 

dx 2 /dy\ dx 2 /dy 2 


1 - 1/2 -yi 


= -y i- 


Then, g(yi,y 2 ) = f(ym, yi(l -y 2 ))\J\ = y^e y \ for y x > 0 and 0 < y 2 < 1. Therefore, 


g(yi) = / Vie yi dy 2 = y x e y \ y l > 0, 


and 


g(y 2 ) = / Vie yi dyi = T(2) = 1, 0 < y 2 < 1 . 

Jo 

Since g(yi,y- 2 ) = g(yi)g(y 2 ), the random variables Y\ and Y 2 are independent. 
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7.13 Since / and R are independent, the joint probability distribution is 

f(i,r ) = 12ri(l — i), 0 < % < 1 , 0 < r < 1 . 

Let V = R. The inverse functions of w = i 2 r and v = r are i = \Jw/v and r — v. for 
w < v < 1 and 0 < w < 1 , from which we obtain 

di/dw di/dv _ 1 

dr/dw dr/dv 2^/vw' 

Then, 

g(w,v) = f(y/w/v,v)\J\ = 12vy/w/v(l - ^w/v)—^= = 6(1 - y/wjv), 

2 y/VW 


for w < v < 1 and 0 < w < 1, and the marginal distribution of W is 

h(w) — 6 f (1 — \fwjv ) dv = 6 (v — 2y/wv)\ v v J w = 6 + 6w — 12Vw, 0 < w < 1. 

J W 

7.14 The inverse functions of y = x 2 are given by X\ = y/y and X 2 = — yfy from which we 
obtain J\ = 1/2 yfy and J 2 = 1/2^/y. Therefore, 

= WW + «-V5)l ^1 = ^ ' 5R + ^ ' UT 1/2 ^’ 

for 0 < y < 1 . 

7.15 The inverse functions of y = x 2 are X\ = ^/y, X 2 = — y/y for 0 < y < 1 and x\ = yfy 
for 0 < y < 4. Now \R\ = \ J 2 \ = | J 3 I = 1 / 2 ^/y, from which we get 

3(4/) - f(Vv)\ il /( W)l 2 I - g ' 2 ^/y 9 ' 2y/y~ 9y/y' 

for 0 < y < 1 and 

g(y) = /(V5)W = 2(V?+ 11 ■ — = 2^±1, for 1 < y < 4. 
yvv; ./ wvu <ji 9 2v /y 9^/y ’ y 


7.16 Using the formula we obtain 

• r r , /3“+T(tt + r 

ft- = E(X)= x ■ dx = 


x 1 e 


/?“T(a) 


/5“T(a) Jo /3“ +r T(a + r) 


/3 r r(o; + r) 

rR 


since the second integrand is a gamma density with parameters^ + r and f3. 
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7.17 The moment-generating function of X is 


M x (t) = E{e tx ) = jY, etX = 


X=1 


e*(l - e kt ) 
k( 1 — e*) 


by summing the geometric series of fc terms. 

7.18 The moment-generating function of X is 


M x (t) = E(e ,x ) =p£eV-‘ = ^£(A)‘ 


pe 


1 — qe l 1 


X=1 X=1 

by summing an infinite geometric series. To find out the moments, we use 

(1 — ge*)pe* + pqe 2t I (1 — g)p + pg 1 


I - 1 — -^x(O) — 


(1 — ge*)^ 


t=o 




P 


and 


I- 1 2 — -^x(O) 


1 — q^ype 1 + 2pqe 2t (l — qe *) 


(1 — ge*) 4 


t=o 


2 — p 

p2 


So, cr" = /i 2 — p 2 = 


2 _ q 


7.19 The moment-generating function of a Poisson random variable is 


M x («) = £(e“) = £ 


00 e tx e~ fl p x _ ^ (^ e tu 




X! 


£ 

te=0 


_ g-MgMe* _ e At(e‘-l)_ 


X! 


So, 


p = M' x { 0) = p e^ iet - 1)+t 


t = 0 


/i, 


/4 = M x(°) = h eAt(et 1)+t (he* + 1) = p{p + 1), 

t=o 


and 


a 2 = p 2 — p 2 = p(p + 1 ) — p 2 = p. 

7.20 From Mx(t) = e 4 ( e * _1 ) we obtain p = 6, <r 2 = 4, and a = 2. Therefore, 

7 

P(/z - 2a < X < p + 2a) = P( 0 < X < 8 ) = p(x; 4) = 0.9489 - 0.0183 

£=1 


0.9306. 



90 


Chapter 7 Functions of Random Variables 


7.21 Using the moment-generating function of the chi-squared distribution, we obtain 

li = M' x (0) = v(l-2t)-”l 2 -\ =o = v, 
ft> = Aft(O) = v(v + 2) (1 - 2f) _ ” /2_2 | (=0 = v(v + 2). 

So, o' 2 = p' 2 — p 2 = v(v + 2) — v 2 = 2v. 


7.22 


M x (t ) = / e f(x) dx = 


' — OO 
ROO 


J — OO 
r»QO 


t 2 x 2 




1 + tx + — - H 1 — H f(x ) dx 

2 ! r ! 


j. 2 /-oo 


/(x) dx + t x/(x) dx + 


x 2 /(x) dx 


-j. r roo ^2 

+ • • • 4 r / X r f (x) dx + • • • = 1 + /it + /ii — + •••-(- h r —: + • • • 

r! / „ 2! r! 


7.23 The joint distribution of X and Y is /x,y(x, ?/) = e _a;_y for x > 0 and y > 0. The 
inverse functions of u — x + // and v — x/(x +y) are x = uv and y = u{ 1 — n) with 
n u 


J = 


-u 


= u for u > 0 and 0 < v < 1. So, the joint distribution of U and V is 


fu, v (u,v ) = ue~ uv ■ e~ u(1 ~ v) = ue ~ u , 
for m > 0 and 0 < v < 1. 

(a) fu(u) = f^ue~ u dv = ue~ u for u > 0, which is a gamma distribution with 
parameters 2 and 1. 

(b) fv(v) = J 0 °° ue~ u du — 1 for 0 < v < 1. This is a uniform (0,1) distribution. 
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Fundamental Sampling Distributions 
and Data Descriptions 

8.1 (a) Responses of all people in Richmond who have telephones. 

(b) Outcomes for a large or infinite number of tosses of a coin. 

(c) Length of life of such tennis shoes when worn on the professional tour. 

(d) All possible time intervals for this lawyer to drive from her home to her office. 

8.2 (a) Number of tickets issued by all state troopers in Montgomery County during the 

Memorial holiday weekend. 

(b) Number of tickets issued by all state troopers in South Carolina during the Memo- 
rial holiday weekend. 

8.3 (a) x = 2.4. 

(b) x — 2. 

(c) m = 3. 

8.4 (a) x = 8.6 minutes. 

(b) x = 9.5 minutes. 

(c) Mode are 5 and 10 minutes. 

8.5 (a) x = 3.2 seconds. 

(b) x = 3.1 seconds. 

8.6 (a) x = 35.7 grams. 

(b) x = 32.5 grams. 

(c) Mode=29 grams. 

8.7 (a) x = 53.75. 
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(b) Modes are 75 and 100. 

8.8 x = 22.2 days, x — 14 days and rn — 8 days, x is the best measure of the center of the 
data. The mean should not be used on account of the extreme value 95, and the mode 
is not desirable because the sample size is too small. 


1.9 (a) Range = 15 — 5 = 10. 


n Y2 x 2 —( Y2 Xi) 2 

(b) s 2 = 2=1 n(<n di) = (10) Q 0 U 9 T 8h = 10-933. Taking the square root, we have 


s = 3.307. 


(10) (9) 


8.10 (a) Range = 4.3 - 2.3 = 2.0. 

n n 

fb) .s-’ - - £i X *~ { £i Xi)2 _ (9)(96.1 1) — 28.8 2 _ 0 498 
b ~ n(n- 1) — (9) (8) “ 

n 

8.11 (a) s 2 = ^ £ ( Xi - x) 2 = ^[(2 - 2.4) 2 + (1 - 2.4) 2 + • • • + (2 - 2.4) 2 ] = 2.971. 

X=1 

n n 

«E*ME xi) 2 

(U\ „2 _ ti i=! _ (15)(128)-36 2 _ 9 q71 

V U ) * — n(n-l) — (15) (14) — 

8.12 (a) x = 11.69 milligrams. 


n n 


n V x 2 -(Y^ x) 2 

‘ £T ‘ _ (8)(1168.21) — 93.5 2 _ m 

V u / * — nCn-11 — (8)(7) — 1U.MU. 


.13 s 2 = 


n(n— 1) 

n 

n E x?-(E x d 2 


2 _ 4=i ' hti " _ (2)(148.55)-53.3 2 

n(n-l) (20) (19) 


= 0.342 and hence s = 0.585. 


8.14 (a) Replace Ah in S 2 by X, + c for % — 1,2, ... ,n. Then X becomes X + c and 


n 

s 2 = — j- ElW + c ) - (V + c)] 2 

i = 1 


1 

n — 1 


E< A '. - A > 2 - 
1=1 


(b) Replace X t by cX, in S 2 for i = 1,2 , ... , n. Then X becomes cX and 


S 2 



^(cA, - cA ) 2 

i= 1 



E( A - - A > 2 - 

i= 1 


8.15 s 2 = 


»E^-(E = 

i=l i = 1 

n(n— 1) 


(6)(207)-33 2 _ c 1 
(6)(5) - 0 - 1 - 


(a) Multiplying each observation by 3 gives s 2 = (9) (5.1) = 45.9. 

(b) Adding 5 to each observation does not change the variance. Hence s 2 


5.1. 


8.16 Denote by D the difference in scores. 
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(a) D = 25.15. 

(b) D = 31.00. 


8.17 z i = —1.9, Z 2 = —0.4. Hence, 

P(px ~ l-9ox < X < Ex ~ 0.4<7_y) = P(-1.9 < Z < -0.4) = 0.3446 - 0.0287 = 0.3159. 

8.18 n = 54, px = 4, a\ = a 2 /n = (8/3)/ 54 = 4/81 with ax — 2/9. So, 

z x = (4.15- 4)/(2/9) = 0.68, and z 2 = (4.35 - 4)/(2/9) = 1.58, 

and 

P(4.15 < X < 4.35) = P(0.68 < Z < 1.58) = 0.9429 - 0.7517 = 0.1912. 

8.19 (a) For n = 64, ax = 5.6/8 = 0.7, whereas for n = 196, — 5.6/14 = 0.4. 

Therefore, the variance of the sample mean is reduced from 0.49 to 0.16 when the 
sample size is increased from 64 to 196. 

(b) For n = 784, ax = 5.6/28 = 0.2, whereas for n = 49, ax = 5.6/7 = 0.8. 
Therefore, the variance of the sample mean is increased from 0.04 to 0.64 when 
the sample size is decreased from 784 to 49. 

8.20 n = 36, ax = 2. Hence a = \fnax = (6) (2) = 12. If ax = 1.2, then 1.2 = 12/ydr and 
n = 100. 

8.21 fix = E = 240, ax = 15//40 = 2.372. Therefore, fi x ± 2a x = 240 ± (2) (2.372) or from 
235.257 to 244.743, which indicates that a value of x — 236 milliliters is reasonable 
and hence the machine needs not be adjusted. 

8.22 (a) fix = E = 174.5, = ° / V™ = 6-9/5 = 1.38. 

(b) *i = (172.45 - 174.5)/1.38 = -1.49, = (175.85 - 174.5)/1.38 = 0.98. So, 

P(172.45 < X < 175.85) = P(-1.49 < Z < 0.98) = 0.8365 - 0.0681 = 0.7684. 

Therefore, the number of sample means between 172.5 and 175.8 inclusive is 
(200) (0.7684) = 154. 

(c) z = (171.95 - 174.5)/1.38 = -1.85. So, 

P(X < 171.95) = P(Z < -1.85) = 0.0322. 

Therefore, about (200) (0.0322) = 6 sample means fall below 172.0 centimeters. 

(a) fi = = (4)(0.2) + (5)(0.4) + (6)(0.3) + (7)(0.1) = 5.3, and 

a 2 _ YXx - fi) 2 f(x) = (4 - 5.3) 2 (0.2) + (5 - 5.3) 2 (0.4) + (6 - 5.3) 2 (0.3) - (7 - 
5.3) 2 (0.1) = 0.81. 


8.23 



94 


Chapter 8 Fundamental Sampling Distributions and Data Descriptions 


(b) With n = 36, px — p — 5.3 and a + = cr 2 /n = 0.81/36 = 0.0225. 

(c) n = 36, px = 5.3, = 0.9/6 = 0.15, and 2 = (5.5 - 5.3)/0.15 = 1.33. So, 

P(X < 5.5) = P(Z < 1.33) = 0.9082. 

8.24 n = 36, p x = 40, a* = 2/6 = 1/3 and 2 = (40.5 - 40)/(l/3) = 1.5. So, 

P Xi > 1458^ = P(X > 40.5) = P(Z > 1.5) = 1 - 0.9332 = 0.0668. 

8.25 (a) P(6.4 < X < 7.2) = P(-1.8 < Z < 0.6) = 0.6898. 

(b) 2 = 1.04, x = z(a /y/n) +p = (1.04)(l/3) + 7 = 7.35. 

8.26 n = 64, p^ = 3.2, cry = cr/i/n = 1.6/8 = 0.2. 

(a) 2 = (2.7 - 3.2)/0.2 = -2.5, P(X < 2.7) = P(Z < -2.5) = 0.0062. 

(b) 2 = (3.5 - 3.2)/0.2 = 1.5, P(X > 3.5) = P(Z > 1.5) = 1 - 0.9332 = 0.0668. 

(c) 21 = (3.2 - 3.2)/0.2 = 0, z 2 = (3.4 - 3.2)/0.2 = 1.0, 

P( 3.2 < X < 3.4) = P(0 < Z < 1.0) = 0.9413 - 0.5000 = 0.3413. 

8.27 n = 50, x = 0.23 and a = 0.1. Now, 2 = (0.23 - 0.2)/(0.1/V50) = 2.12; so 

P(X > 0.23) = P(Z > 2.12) = 0.0170. 

Hence the probability of having such observations, given the mean p = 0.20, is small. 
Therefore, the mean amount to be 0.20 is not likely to be true. 

8.28 p\ — P'2 = 80-75 = 5, (T^-Xt = y^/ 25 + 9/36 = 1.118, zi = (3.35— 5)/l . 1 18 = -1.48 
and z 2 = (5.85 - 5)/1.118 = 0.76. So, 

P(3.35 < X-i - X 2 < 5.85) = P(-1.48 < Z < 0.76) = 0.7764 - 0.0694 = 0.7070. 

8.29 p Xl - x 2 = 72 - 28 = 44_, a Xl -x 2 = y/ 100 / 64 + 25/1.00 = 1.346 and z = (44.2 - 
44)/ 1.346 = 0.15. So, P(Xi - X 2 < 44.2) = P(Z < 0.15) = 0.5596. 

8.30 pi - p 2 = 0, (7 x 1 —x 2 = 50^1/32 + 1/50 = 11.319. 

(a) z 1 = — 20/11.319 = -1.77, = 20/11.319 = 1.77, so 

P( \Xi - X 2 \ > 20) = 2 P(Z < -1.77) = (2)(0.0384) = 0.0768. 

(b) zi = 5/11.319 = 0.44 and z 2 = 10/11319 = 0.88. So, 

P(-10 < Ad - X 2 < -5) + P(5 < X, - X 2 < 10) = 2P(5 < - X 2 < 10) = 

2P(0.44 < Z < 0.88) = 2(0.8106 - 0.6700) = 0.2812. 


8.31 The normal quantile-quantile plot is shown as 
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Normal Q-Q Plot 
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Theoretical Quantiles 


8.32 (a) If the two population mean drying times are truly equal, the probability that the 

difference of the two sample means is 1.0 is 0.0013, which is very small. This means 
that the assumption of the equality of the population means are not reasonable. 

(b) If the experiment was run 10,000 times, there would be (10000) (0.0013) = 13 
experiments where X A — X B would be at least 1.0. 

8.33 (a) ni = n 2 = 36 and * = 0.2/^1/36 + 1/36 = 0.85. So, 

P(X B -X A > 0.2) = P(Z > 0.85) = 0.1977. 

(b) Since the probability in (a) is not negligible, the conjecture is not true. 

8.34 The normal quantile-quantile plot is shown as 

Normal Q-Q Plot 



Theoretical Quantiles 
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8.35 (a) When the population equals the limit, the probability of a sample mean exceeding 

the limit would be 1/2 due the symmetry of the approximated normal distribution. 

(b) P(X > 7960 | p = 7950) = P(Z > (7960 - 7950)7(100/^25)) = P(Z > 0.5) = 
0.3085. No, this is not very strong evidence that the population mean of the 
process exceeds the government limit. 

8.36 (a) (?x a -x b = + § = 1-29 and 2 = = 3.10. So, 

P(X A - X B > 4 | ha = Fb) = P{Z > 3.10) = 0.0010. 

Such a small probability means that the difference of 4 is not likely if the two 
population means are equal. 

(b) Yes, the data strongly support alloy A. 

8.37 Since the probability that X < 775 is 0.0062, given that p = 800 is true, it suggests 
that this event is very rare and it is very likely that the claim of p — 800 is not true. 
On the other hand, if p is truly, say, 760, the probability 

P(X < 775 i p = 760) = P(Z < (775 - 760)/(40/Vl6)) = P(Z < 1.5) = 0.9332, 

which is very high. 

8.38 Define Wi = In X t for i = 1,2,.... Using the central limit theorem, Z = ( W — 
Pwi) / \°Wxl V™) ~ n(z;0, 1). Hence W follows, approximately, a normal distribution 
when n is large. Since 

n / n \ t 

W=-Y, H*i) = - TT X * = ~ ln ( y ')’ 

n n \ - LX / n 

i— 1 \ 7=1 / 

then it is easily seen that Y follows, approximately, a lognormal distribution. 

8.39 (a) 27.488. 

(b) 18.475. 

(c) 36.415. 

8.40 (a) 16.750. 

(b) 30.144. 

(c) 26.217. 

8.41 (a) xl = Xlna = 0.297. 

(b) xi; = X0.025 = 32.852. 

(c) X 0.05 = 37.652. Therefore, a = 0.05 — 0.045 = 0.005. Hence, Xa — X 0.005 = 46.928. 

8.42 (a) = x 2 m = 38.932. 



Solutions for Exercises in Chapter 8 


97 


(b) X 2 = X 0.05 = 12.592. 

(c) Xo.oi = 23.209 and Xo .025 = 20-483 with a = 0.01 + 0.015 = 0.025. 

8.43 (a) P{S 2 > 9.1) = P > (24) f' 1} ) = P(x 2 > 36.4) = 0.05. 

(b) / > (3.462 < S 2 < 10.745) = P (MM < < MiiMS) 

= P(13.848 < x 2 < 42.980) = 0.95 - 0.01 = 0.94. 

8.44 x 2 — ^ 19 g 20 ^ = 47.5 while Xo.oi = 36.191. Conclusion values are not valid. 

8.45 Since ( "~ 1 L , ) ' S is a chi-square statistic, it follows that 

9 (n — l) 2 9 . , 

<J (n-l)S 2 / ( r 2 = ^4 a s 2 = 2 ( n ~ !)• 

Hence, <r| 2 = which decreases as n increases. 

8.46 (a) 2.145. 

(b) -1.372. 

(c) -3.499. 

8.47 (a) P(T < 2.365) = 1 - 0.025 = 0.975. 

(b) P(T > 1.318) = 0.10. 

(c) P(T < 2.179) = 1 - 0.025 = 0.975, P(T < -1.356) = P(T > 1.356) = 0.10. 
Therefore, P(-1.356 <T < 2.179) = 0.975 - 0.010 = 0.875. 

(d) P(T > -2.567) = 1 - P(T > 2.567) = 1 - 0.01 = 0.99. 

8.48 (a) Since fo.oi leaves an area of 0.01 to the right, and — to.oos an area of 0.005 to the 

left, we find the total area to be 1 — 0.01 — 0.005 = 0.985 between — fo .005 and 
^o.oi- Hence, P(— to.oos < T < fo.oi) — 0.985. 

(b) Since — fo .025 leaves an area of 0.025 to the left, the desired area is 1—0.025 = 0.975. 
That is, P(T > —fo. 025 ) = 0.975. 

8.49 (a) From Table A. 4 we note that 2.069 corresponds to fo .025 when v = 23. Therefore, 

— ^ fo .025 = —2.069 which means that the total area under the curve to the left of 
t = k is 0.025 + 0.965 = 0.990. Hence, k = fo.oi = 2.500. 

(b) From Table A. 4 we note that 2.807 corresponds to to .005 when v = 23. Therefore 
the total area under the curve to the right of f = k is 0.095 + 0.005 = 0.10. Hence, 
k = fo .10 = 1-319. 

(c) f 0 .o 5 = 1-714 for 23 degrees of freedom. 
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8.50 From Table A. 4 we find fo .025 = 2.131 for v = 15 degrees of freedom. Since the value 

27.5 - 30 

t = r-7T = -2.00 

5/4 

falls between —2.131 and 2.131, the claim is valid. 

8.51 t = (24 — 20) / (4.1/3) = 2.927, f 0 .oi = 2.896 with 8 degrees of freedom. Conclusion: 
no, /i > 20. 

8.52 x = 0.475, s 2 = 0.0336 and t = (0.475 - 0.5)/0.0648 = -0.39. Hence 

P(X < 0.475) = P(T < -0.39) « 0.35. 


So, the result is inconclusive. 

8.53 (a) 2.71. 

(b) 3.51. 

(c) 2.92. 

(d) 1/2.11 = 0.47. 

(e) 1/2.90 = 0.34. 

8.54 s 2 = 10.441 and s\ = 1.846 which gives / = 5.66. Since, from Table A. 6, f 0 . 05 ( 9 , 7) = 
3.68 and /o.oi(9, 7) = 6.72, the probability of P(F > 5.66) should be between 0.01 and 
0.05, which is quite small. Hence the variances may not be equal. Furthermore, if a 
computer software can be used, the exact probability of F > 5.66 can be found 0.0162, 
or if two sides are considered, P(F < 1/5.66) + P(F > 5.66) = 0.026. 

8.55 s 2 = 15750 and s\ = 10920 which gives / = 1.44. Since, from Table A. 6, /o.o5(4, 5) = 
5.19, the probability of F > 1.44 is much bigger than 0.05, which means that the two 
variances may be considered equal. The actual probability of F > 1.44 is 0.3436 and 
P(F < 1/1.44) + P(F > 1.44) = 0.7158. 

8.56 The box-and- whisker plot is shown below. 


Box-and-Whisker Plot 
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The sample mean = 12.32 and the sample standard deviation = 6.08. 

8.57 The moment-generating function for the gamma distribution is given by 

1 f°° 

M x (t ) = E(e tx ) = - — — / e tx x a ~ l e~ x ^ dx 
(3 a T[a) J 0 

1 1 

~ P°(l/ (3- t)° (1/(3- t)-°T(a) 

1 f°° x°‘- 1 e~ x l^ 1 l^- t ^ 1 1 

~~ J 0 [(1 /(3-t)~^T(a) dX ~ (1 -/?*)“’ 

for t < 1/(3, since the last integral is one due to the integrand being a gamma density 
function. Therefore, the moment-generating function of an exponential distribution, 
by substituting a to 1, is given by M x (t ) = (1 — Ot) 1 . Hence, the moment-generating 
function of Y can be expressed as 

n 

M Y (t) = M Xl {t)M X2 {t) ■ ■ ■ M Xn (t) = J](1 - et)- 1 = (1 - et)~ n , 

i= 1 

which is seen to be the moment-generating function of a gamma distribution with 
a — n and (3 = 6 . 


x 


•-ie-KlH) 


dx 


8.58 The variance of the carbon monoxide contents is the same as the variance of the coded 

measurements. That is, s 2 = _ 7,039, which results in s = 2.653. 

8.59 P (fj < 4.89) = P < 4.89) = P(F < 4.89) = 0.99, where F has 7 and 11 
degrees of freedom. 


8.60 s 2 = 114,700,000. 

8.61 Let X\ and X 2 be Poisson variables with parameters Ai = 6 and A 2 = 6 representing the 
number of hurricanes during the first and second years, respectively. Then Y = X\ ~hX 2 
has a Poisson distribution with parameter A = Ai + A 2 = 12. 

(a) P(Y = 15) = e ^ 5 ] 215 = 0.0724. 

(b) P(Y <9 ) = E — r ~ = °- 2424 - 

y = 0 v ' 


8.62 Dividing each observation by 1000 and then subtracting 55 yields the following data: 
—7, —2, —10, 6, 4, 1, 8, —6, —2, and —1. The variance of this coded data is 

(10)(311) (-9)^33 656 

(10) (9) 

Hence, with c = 1000, we have 

s 2 = (1000) 2 (33.656) = 33.656 x 10 6 , 


and then s = 5801 kilometers. 
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8.63 The box-and- whisker plot is shown below. 

Box-and_Whisker Plot 


LO 



The sample mean is 2.7967 and the sample standard deviation is 2.2273. 

8.64 P (§| > 1.26) = P (§f^| > (15 ) [q 26) ) = P(F > 1.89) « 0.05, where F has 24 and 30 
degrees of freedom. 

8.65 No outliers. 


8.66 The value 32 is a possible outlier. 


8.67 /j = 5,000 psi, o = 400 psi, and n = 36. 


(a) Using approximate normal distribution (by CLT), 


P(4800 < X < 5200) 


/ 4800 - 5000 5200 - 5000\ 

V 400/^ < < 400/ v / 36 ) 

P(—3 < Z < 3) = 0.9974. 


(b) To find a z such that P{— z < Z < z) = 0.99, we have P(Z < z) = 0.995, which 
results in z = 2.575. Hence, by solving 2.575 = 51 ^^° we have n > 107. Note 
that the value n can be affected by the z values picked (2.57 or 2.58). 


8.68 x = 54,100 and s = 5801.34. Hence 

54100 - 53000 

t = = 0.60. 

5801.34/^10 


So, P{ X > 54, 100) = P(T > 0.60) is a value between 0.20 and 0.30, which is not a 
rare event. 


8.69 tia = ns = 20, xa = 20.50, xb = 24.50, and cr a = &b = 5. 

(a) P(X A - X B > 4.0 | p A = p B ) = P{Z > (24.5 - 20.5)/^5 2 /20 + 5 2 /20) 
= P(Z > 4.5/(5/v / T0)) = P(Z > 2.85) = 0.0022. 
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(b) It is extremely unlikely that p A = p b ■ 

8.70 (a) n A = 30, x A = 64.5% and a a = 5%. Hence, 

P(X A < 64.5 | p A = 65) = P(Z < (64.5 - 65)/(5/v / 30)) = P(Z < -0.55) 

= 0.2912. 

There is no evidence that the p A is less than 65%. 

(b) ns = 30, xb = 70% and a B = 5%. It turns out <Jx B -x A — \J fjj + fjj = 1-29%. 
Hence, 

^ = P(Z > 4.26) « 0. 

It does strongly support that pb is greater than p A - 

(c) i) Since uy s = = 0.9129, X B ~ n(x; 65%, 0.9129%). 

ii) X A — Xb ~ n(x; 0, 1.29%). 

775ft °- 1 >- 

8.71 P(X B >7I)) = P(Z> gyf) = P(Z > 5.48) « 0. 

8.72 It is known, from Table A. 3, that P(— 1.96 < Z < 1.96) = 0.95. Given p — 20 and 

a = \/9 = 3, we equate 1.96 = 20 y^° to obtain n = j — 3457.44 rs 3458. 

8.73 It is known that P(— 2.575 < Z < 2.575) = 0.99. Hence, by equating 2.575 = jtj=, we 
obtain n = (|^) 2 = 2652.25 « 2653. 

8.74 p = 9 and a — 1. Then 

P( 9 - 1.5 < X < 9 + 1.5) = P(7.5 < X < 10.5) = P(-1.5 < Z < 1.5) 

= 0.9322 - 0.0668 = 0.8654. 

Thus the proportion of defective is 1 — 0.8654 = 0.1346. To meet the specifications 
99% of the time, we need to equate 2.575 = since P(— 2.575 < Z < 2.575) = 0.99. 
Therefore, a = = 0.5825. 

8.75 With the 39 degrees of freedom, 

P(S 2 < 0.188 | u 2 = 1.0) = P(x 2 < (39)(0.188)) = P(y 2 < 7.332) w 0, 

which means that it is impossible to observe s 2 = 0.188 with n = 40 for a' 2 = 1. 

Note that Table A. 5 does not provide any values for the degrees of freedom to be larger 
than 30. However, one can deduce the conclusion based on the values in the last line 
of the table. Also, computer software gives the value of 0. 


X A > 5.5 | p A = Pb) = P ( Z > 
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One- and Two-Sample Estimation 
Problems 


9.1 From Example 9.1 on page 271, we know that E(S 2 ) = cr 2 . Therefore, 


E(S' 2 ) = E 


n — 1 


S 2 


n 




n 


n 


9.2 


(a) 

(b) 


E(X) = np ; E(P) = E(X/n ) = E(X)/n 

— E(X)+y/n/2 _ np+y/n/2 / 

' ' n+y/n n+y/n ' 


np/n = p. 


9.3 


lim n P + p! 2 

n— >oo n +V^ 


lim 

n— >oo 


P+l/2y/n 
1 + 1 / 


= p. 


9.4 n = 30, a; = 780, and a = 40. Also, z 0 . 02 = 2.054. So, a 96% confidence interval for the 
population mean can be calculated as 


780 - (2.054) (40/V30) < p < 780 + (2.054) (40/v / 30), 


or 765 < p < 795. 

9.5 n = 75, x = 0.310, a = 0.0015, and 20.025 = 1.96. A 95% confidence interval for the 
population mean is 

0.310 - (1.96)(0.0015/v / 75) < p < 0.310 + (1.96) (0.0015/ V75), 


or 0.3097 < p < 0.3103. 

9.6 n = 50, x = 174.5, a = 6.9, and 2 0 .oi = 2.33. 

(a) A 98% confidence interval for the population mean is 

174.5 - (2.33)(6.9/v / 50) < p < 174.5 + (2.33) (6.9/V50), or 172.23 < p < 176.77. 

(b) e < (2.33) (6.9)/ ^50 = 2.27. 
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9.7 n = 100, x = 23, 500, a = 3900, and ^ 0.005 = 2.575. 

(a) A 99% confidence interval for the population mean is 

23, 500 - (2.575)(3900/10) < p < 23, 500 + (2.575)(3900/10), or 
22,496 < p < 24,504. 

(b) e < (2.575)(3900/10) = 1004. 

9.8 n = [(2.05)(40)/10] 2 = 68 when rounded up. 

9.9 n = [(1.96) (0.0015) / 0.0005] 2 = 35 when rounded up. 

9.10 n = [(1.96)(40)/15] 2 = 28 when rounded up. 

9.11 n = [(2.575) (5. 8)/2] 2 = 56 when rounded up. 

9.12 n — 20, x — 11.3, s = 2.45, and fo .025 = 2.093 with 19 degrees of freedom. A 95% 
confidence interval for the population mean is 

11.3 - (2.093)(2.45/v^0) < /J < 11.3 + (2.093)(2.45/V20), 
or 10.15 < p < 12.45. 

9.13 n — 9, x — 1.0056, s = 0.0245, and f 0 .oo 5 = 3.355 with 8 degrees of freedom. A 99% 
confidence interval for the population mean is 

1.0056 - (3. 355)(0. 0245/3) < p< 1.0056 + (3.355)(0.0245/3), 
or 0.978 < p < 1.033. 

9.14 n — 10, x — 230, s = 15, and %.oo 5 = 3.25 with 9 degrees of freedom. A 99% confidence 
interval for the population mean is 

230 - (3.25) (15/\/l0) < p < 230 + (3.25) (15/^10), 
or 214.58 < /j < 245.42. 

9.15 n = 12, x — 48.50, s = 1.5, and t 0 .05 = 1-796 with 11 degrees of freedom. A 90% 
confidence interval for the population mean is 

48.50 - (1.796)(1.5/^12) < p < 48.50 + (1.796)(1.5/v / 12), 
or 47.722 < p < 49.278. 

9.16 n = 12, x = 79.3, s = 7.8, and t 0 .025 = 2.201 with 11 degrees of freedom. A 95% 
confidence interval for the population mean is 

79.3 - (2.201) (7.8/-\/l2) < p < 79.3 + (2.201)(7.8/^12), 
or 74.34 < p < 84.26. 
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9.17 n = 25,x = 325.05, s = 0.5,7 = 5%, and 1 - a = 90%, with k = 2.208. So, 325.05 ± 
(2.208)(0.5) yields (323.946,326.151). Thus, we are 95% confident that this tolerance 
interval will contain 90% of the aspirin contents for this brand of buffered aspirin. 

9.18 n — 15, x = 3.7867,s = 0.9709,7 = 1%, and 1 — a = 95%, with k = 3.507. So, by 
calculating 3.7867 ± (3.507) (0.9709) we obtain (0.382, 7.192) which is a 99% tolerance 
interval that will contain 95% of the drying times. 

9.19 n = 100, x = 23,500,s = 3,900,1 — a = 0.99, and 7 = 0.01, with k = 3.096. The 
tolerance interval is 23,500 ± (3.096) (3,900) which yields 11,425 < fi < 35,574. 

9.20 n = 12, x = 48.50, 5 = 1.5, 1 — a = 0.90, and 7 = 0.05, with k = 2.655. The tolerance 
interval is 48.50 ± (2.655)(1.5) which yields (44.52,52.48). 

9.21 By definition, MSE = E(Q — 6 ) 2 which can be expressed as 

MSE = E[Q - E(Q) + E(Q) - 9 } 2 

= E[Q - U(0)] 2 + E[E(Q) - 9 } 2 + 2 E[G - E(Q)]E[E(Q) - 9}. 

The third term on the right hand side is zero since E[Q — 77(©)] = E[Q] — E(Q) = 0. 
Hence the claim is valid. 

9.22 (a) The bias is E(S' 2 ) - a 2 = - u 2 = 

(b) lim Bias = lim — = 0. 

n— >00 n— >00 n 

9.23 Using Theorem 8.4, we know that X 2 = < ' IL ^) S follows a chi-squared distribution with 

n — 1 degrees of freedom, whose variance is 2 (n — 1). So, V ar{S 2 ) = V ar X 2 j = 

^a 4 , and Var(S' 2 ) = Var (^S 2 ) = ) 2 Var(S 2 ) = 2(n J )rT> . Therefore, the 

variance of S' 2 is smaller. 

9.24 Using Exercises 9.21 and 9.23, 

MSE(S 2 ) _ Var(S 2 ) + [Bias(S 2 )] 2 _ 2 a 4 /(n - 1) 

MSE{S /2 ) ““ Var(S' 2 ) + [Bias(^ /2 )] 2 “ 2(n - l)u 4 /™ 2 + a 4 jn 2 

3n-l 

+ 2 n 2 -3n+l’ 

which is always larger than 1 when n is larger than 1. Hence the MSE of S' 2 is usually 
smaller. 

9.25 n = 20, a; = 11.3,5 = 2.45, and to.025 = 2.093 with 19 degrees of freedom. A 95% 
prediction interval for a future observation is 

11.3 ± (2.093) (2.45) \/I + 1/20 = 11.3 ± 5.25, 


which yields (6.05, 16.55). 
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9.26 n = 12, x = 79.3, s = 7.8, and to . 025 = 2.201 with 11 degrees of freedom. A 95% 
prediction interval for a future observation is 

79.3 ± (2.201) (7.8) \/I + 1/12 = 79.3 ± 17.87, 
which yields (61.43,97.17). 

9.27 n — lh,x — 3.7867, s = 0.9709, and fo .025 = 2.145 with 14 degrees of freedom. A 95% 
prediction interval for a new observation is 

3.7867 ± (2. 145) (0.9709) y/l + 1/15 = 3.7867 ± 2.1509, 
which yields (1.6358,5.9376). 

9.28 n = 9, x = 1.0056, s = 0.0245,1 - a = 0.95, and 7 = 0.05, with k = 3.532. The 
tolerance interval is 1.0056 ± (3.532)(0.0245) which yields (0.919, 1.092). 

9.29 n = 15,1; = 3.84, and s = 3.07. To calculate an upper 95% prediction limit, we 
obtain f 0 .05 = 1-761 with 14 degrees of freedom. So, the upper limit is 3.84 + 
(1.761)(3.07)y/l + 1/15 = 3.84 + 5.58 = 9.42. This means that a new observation 
will have a chance of 95% to fall into the interval (— 00 , 9.42). To obtain an up- 
per 95% tolerance limit, using 1 — a = 0.95 and 7 = 0.05, with k = 2.566, we get 
3.84 + (2.566)(3.07) = 11.72. Hence, we are 95% confident that (— 00 , 11.72) will 
contain 95% of the orthophosphorous measurements in the river. 

9.30 n = 50,5; = 78.3, and s = 5.6. Since tom = 1-677 with 49 degrees of freedom, 
the bound of a lower 95% prediction interval for a single new observation is 78.3 — 
(1.677)(5.6)a/1 + 1/50 = 68.91. So, the interval is (68.91, 00 ). On the other hand, 
with 1 — a = 95% and 7 = 0.01, the k value for a one-sided tolerance limit is 2.269 
and the bound is 78.3 — (2.269) (5.6) = 65.59. So, the tolerance interval is (65.59, 00 ). 

9.31 Since the manufacturer would be more interested in the mean tensile strength for future 
products, it is conceivable that prediction interval and tolerance interval may be more 
interesting than just a confidence interval. 

9.32 This time 1 — a = 0.99 and 7 = 0.05 with k = 3.126. So, the tolerance limit is 
78.3 — (3. 126) (5.6) = 60.79. Since 62 exceeds the lower bound of the interval, yes, this 
is a cause of concern. 

9.33 In Exercise 9.27, a 95% prediction interval for a new observation is calculated as 
(1.6358,5.9377). Since 6.9 is in the outside range of the prediction interval, this new 
observation is likely to be an outlier. 

9.34 n — 12, x = 48.50, s = 1.5,1 — a = 0.95, and 7 = 0.05, with k = 2.815. The lower 
bound of the one-sided tolerance interval is 48.50 — (2.815) (1.5) = 44.275. Their claim 
is not necessarily correct. 
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9.35 n i = 25,772 = 36, x\ = 80, x 2 = 75, <xi = 5, <72 = 3, and £ 0.03 = 1-88. So, a 94% 
confidence interval for pi — p 2 is 

(80 - 75) - (1.88)^25/25 + 9/36 < p 1 - p 2 < (80 - 75) + (1.88)^25/25 + 9/36, 
which yields 2.9 < pi — p 2 < 7.1. 

9.36 riA = 50, ns = 50, xa = 78.3, xb = 87.2, a a = 5.6, and erg = 6.3. It is known that 

£ 0.025 = 1-96. So, a 95% confidence interval for the difference of the population means 
is 

(87.2 - 78.3) ± 1.96a/5.6 2 /50 + 6.3 2 /50 = 8.9 ± 2.34, 

or 6.56 < pa ~ Eb < 11.24. 

9.37 n i = 100, n 2 = 200, x\ = 12.2, x 2 = 9.1, si = 1.1, and s 2 = 0.9. It is known that 
Zqo\ = 2.327. So 

(12.2 - 9.1) ± 2.327^/1. 1 2 /100 + 0.9 2 /200 = 3.1 ± 0.30, 

or 2.80 < pi — p 2 < 3.40. The treatment appears to reduce the mean amount of metal 
removed. 

9.38 rii = 12,7i2 = 10, x\ = 85, x 2 = 81, si = 4, s 2 = 5, and s p = 4.478 with tom = 1-725 
with 20 degrees of freedom. So 

(85 - 81) ± (1.725) (4.478) VV 12 + 1/10 = 4 ± 3.31, 
which yields 0.69 < pi — p 2 < 7.31. 

9.39 rii = 12,712 = 18, x\ = 84, x 2 = 77, s\ = 4, s 2 = 6, and s p = 5.305 with fo.oos = 2.763 
with 28 degrees of freedom. So, 

(84 - 77) ± (2.763)(5.305)Vl/12 + 1/18 = 7 ± 5.46, 
which yields 1.54 < pi — p 2 < 12.46. 

9.40 n i = 10, 77,2 = 10, X\ = 0.399, x 2 = 0.565, Si = 0.07279, s 2 = 0.18674, and s p = 0.14172 
with to . 025 = 2.101 with 18 degrees of freedom. So, 

(0.565 - 0.399) ± (2.101)(0.14172) ^/l/lO + 1/10 = 0.166 ± 0.133, 
which yields 0.033 < pi — p 2 < 0.299. 

9.41 77i = 14, n 2 = 16, Xi = 17, x 2 = 19, s 2 = 1.5, s 2 = 1.8, and s p = 1.289 with t 0 0 05 = 
2.763 with 28 degrees of freedom. So, 

(19 - 17) ± (2.763)(1.289)Vl/16 + 1/14 = 2 ± 1.30, 
which yields 0.70 < pi — p 2 < 3.30. 
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9.42 n i = 12, n -2 = 10, x\ = 16, X 2 = 11, •+ = 1.0, S 2 = 0.8, and s p = 0.915 with t 0 .os = 1-725 
with 20 degrees of freedom. So, 

(16 - 11) ± (1.725)(0.915)Vl/12 + 1/10 = 5 ± 0.68, 

which yields 4.3 < pi — /i 2 < 5.7. 

9.43 71a = n R = 12, xa = 36, 300, xb = 38, 100, = 5, 000, sb = 6, 100, and 

5000 2 /12 + 6100 2 /12 _ 

V ~ ( 5000 2 / 12) 2 . ( 6100 2 / 12) 2 _ ’ 

11 11 

with to. 025 = 2.080 with 21 degrees of freedom. So, 

/5000 2 6100 2 

(36, 300 - 38, 100) ± (2.080)7^- + = -1, 800 ± 4, 736, 

which yields —6, 536 < pa — Pb < 2, 936. 

9.44 n — 8, d — —1112.5, Sd = 1454, with to.oos = 3.499 with 7 degrees of freedom. So, 


-1112.5 ± (3.499) — y=- = -1112.5 ± 1798.7, 

V8 

which yields —2911.2 < p D < 686.2. 

9.45 n — 9,d — 2.778, Sd = 4.5765, with to .025 = 2.306 with 8 degrees of freedom. So, 

4 5765 

2.778 ± (2.306) ‘ _ = 2.778 ± 3.518, 

V9 

which yields —0.74 < p D < 6.30. 

9.46 nj = 5, nu = 7 ,xi = 98.4, xn = 110.7, si = 8.375, and sji = 32.185, with 

_ (8.73575 + 32.18577)2 _ 

V ~ ( 8 . 735 2 / 5) 2 . ( 32 . 185 2 / 7) 2 _ ‘ 

4 ' 6 

So, to . 05 = 1.895 with 7 degrees of freedom. 

(110.7 - 98.4) ± 1.89578.7352/5 + 32. 185 2 /7 = 12.3 ± 24.2, 
which yields —11.9 < Hn — pi < 36.5. 

9.47 n = 10, d — 14.89%, and Sd = 30.4868, with to .025 = 2.262 with 9 degrees of freedom. 
So, 

, . 30.4868 

14.89 ± 2.262 — = 14.89 ± 21.81, 

TTo 


which yields —6.92 < p D < 36.70. 
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9.48 riA = tie = 20, xa = 32.91, xb = 30.47, = 1.57, sb = 1.74, and S p = 1.657. 

(a) to . 025 ~ 2.042 with 38 degrees of freedom. So, 

(32.91 - 30.47) ± (2. 042)(1. 657)^/1/20 + 1/20 = 2.44 ± 1.07, 
which yields 1.37 < pa ~ Pb < 3.51. 

(b) Since it is apparent that type A battery has longer life, it should be adopted. 

9.49 riA = ns = 15, xa = 3.82, x B = 4.94,5^ = 0.7794, s B = 0.7538, and s p = 0.7667 with 
to . 025 = 2.048 with 28 degrees of freedom. So, 

(4.94 - 3.82) ± (2.048) (0.7667)^/1/15 + 1/15 = 1.12 ± 0.57, 

which yields 0.55 < p B — Pa < 1-69. 


9.50 n i = 8, ri 2 = 13, x\ = 1.98, x^ = 1.30, si = 0.51, S 2 = 0.35, and s p = 0.416. to.025 = 
2.093 with 19 degrees of freedom. So, 

(1.98 - 1.30) ± (2.093)(0.416)Vl/8 + 1/13 = 0.68 ± 0.39, 


which yields 0.29 < pi — p 2 < 1.07. 

9.51 (a) n = 200, p = 0.57, q = 0.43, and z 0 . 02 = 2.05. So, 

0.57 ± (2.05)\/^ ^^ ,43) = 0.57 ± 0.072, 

V zuu 

which yields 0.498 < p < 0.642. 

(b) Error < (2.05)^^^ = 0.072. 

9.52 n = 500.p = = 0.97, q = 0.03, and 2 0 .05 = 1-645. So, 


0.97 ± (1.645) 


(0.97)(0.03) 

500 


0.97 ±0.013, 


which yields 0.957 < p < 0.983. 

9.53 n = 1000, p = ^ = 0.228, q = 0.772, and ^.005 = 2.575. So, 


0.228 ± = 0.228 ± 0.034, 

which yields 0.194 < p < 0.262. 


9.54 n = 100, p = y§o = 0-08, q = 0.92, and zq . 01 = 2.33. So, 


0.08 ± (2.33) 


(0.08)(0.92) 


0.08 ±0.063, 


which yields 0.017 < p < 0.143. 


100 
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9.55 (a) n = 40, p = || = 0.85, q = 0.15, and 20.025 = 1.96. So, 


0.85 ± (1.96) 


(0.85)(0.15) 

40 


0.85 ±0.111, 


which yields 0.739 < p < 0.961. 

(b) Since p = 0.8 falls in the confidence interval, we can not conclude that the new 
system is better. 


9.56 n = 100, p = ^ = 0.24, q = 0.76, and 2 0 .oo 5 = 2.575. 


(a) 0.24 ± (2.575) yj l 0 - 24 ^ 0 - 76 ) = 0.24 ± 0.110, which yields 0.130 < p < 0.350. 

(b) Error < (2.575)^^^= 0.110. 

9.57 n = 1600 , p = |,g = |, and 20.025 = 1-96. 

(a) | ± (1.96) yj (2/ 1 3 ^ 1 0 /3) = | ± 0.023, which yields 0.644 < p < 0.690. 

(b) Error < (1.96)y/^g^= 0.023. 

9.58 n = (1,%) ( o n 0 o^ )( °' (,S) = 2090 when round up. 

9.59 n = = 2576 when round up. 

9.60 „ = (2 - 575)2 ( ( 0 ° 0 2 g )( °- 772) = 467 when round up. 

9.61 n = ^ 2.33) ^ 0 . 08 ) ( 0 . 02 ) _ ^gg w J len rounc [ U p. 

9.62 n = ( 4 ^o op 2 = 9604 when round up. 

9.63 n = ^ 4 ))q 7 qp 2 = 16577 when round up. 

9.64 n — ^ 4^0 04^2 =601 when round up. 

9.65 tim = np = 1000, pm = 0.250, qM = 0.750, p F = 0.275, q F = 0.725, and 20.025 = 1-96. 
So 


(0.275 


0.250) ± ( i, 9 6)./(°- 250 )(°- 75 °) + (0.275X0.725) 

; v ; V 1000 1000 


0.025 ±0.039, 


which yields —0.0136 < p F — p M < 0.0636. 
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9.66 m = 250 , n 2 = 175, p 1 = ^ = 0.32, p 2 = ^ = 0.2286, and z om = 1-645. So, 


(0.32 


. , / (0.32) (0.68) (0.2286)(0.7714) 

0.2286) ± (1.645) a/ ^ -± v A ; 


250 


175 


0.0914 ±0.0713, 


which yields 0.0201 < p\ — p 2 < 0.1627. From this study we conclude that there is 
a significantly higher proportion of women in electrical engineering than there is in 
chemical engineering. 

9.67 ri\ = ri2 = 500, pi = = 0.24, p 2 = = 0.196, and 20.05 = 1.645. So, 


(0.24 - 0.196) ±(1.645) 


(0.24)(0.76) (0.196)(0.804) 


500 


500 


0.044 ± 0.0429, 


which yields 0.0011 < Pi — p 2 < 0.0869. Since 0 is not in this confidence interval, we 
conclude, at the level of 90% confidence, that inoculation has an effect on the incidence 
of the disease. 


9.68 n 5 o C = m 5 o C = 20,p 5 o C = 0.50,pi 5 ° c = 0.75, and z 0 , 025 = 1.96. So, 


(0.5 


0.75) ± (1.96) 


(0.50)(0.50) (0.75)(0.25) 


20 


20 


-0.25 ±0.2899, 


which yields —0.5399 < p 5 o C — p 15 o C < 0.0399. Since this interval includes 0, the 
significance of the difference cannot be shown at the confidence level of 95%. 


9.69 n now = 1000, p now = 0.2740, n 9 i = 760, p 9 i = 0.3158, and 20.025 = 1-96. So, 

(0.2740 - 0.3158) ± = -0.0418 ± 0.0431, 

v ; v ’ V 1000 760 

which yields —0.0849 < p now — pgi < 0.0013. Hence, at the confidence level of 95%, 

the significance cannot be shown. 

9.70 ngo = n 9 4 = 20,p 9 o = 0.337, and 694 = 0.362 


9.71 


(a) n 9 op 9 o = (20) (0.337) ~ 7 and n 9 4 p 9 4 = (20) (0.362) ~ 7. 

(b) Since 20.025 = 1-96, ( 0 . 337 - 0.362) ± (1.96) ^°- 337 K°- 663) + (o.362)(o,638) = _ 0 .025± 
0.295, which yields —0.320 < pg 0 — pg 4 < 0.270. Hence there is no evidence, at 
the confidence level of 95%, that there is a change in the proportions. 

s 2 = 0.815 with v — 4 degrees of freedom. Also, X 0.025 = H-143 and X 0.975 = 0.484. 


So, 


(4X0.815) ^ (4X0.815) 


which yields 0.293 < u 2 < 6.736. 


11.143 0.484 

Since this interval contains 1, the claim that a 2 seems valid 
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9.72 


9.73 


9.74 


9.75 


9.76 


9.77 


9.78 


9.79 


s 2 = 16 with v = 19 degrees of freedom. It is known Xo.oi = 36.191 and X0.99 — 7.633. 
Hence 

(19)(16) 2 (19)(16) 2 

; < a 2 < v VA V , or 8.400 < a 2 < 39.827. 


36.191 


7.633 


s 2 = 6.0025 with v = 19 degrees of freedom. Also, X'0.025 = 32.852 and X0.975 = 8-907. 
Hence, 

(19)(6.0025) 2 (19)(6.0025) _ 2 

v A ; < a 2 < v — -, or 3.472 < a 2 < 12.804, 


32.852 


8.907 


s 2 = 0.0006 with v — 8 degrees of freedom. Also, X0.005 = 21.955 and X0.995 = 1-344. 
Hence, 

(8)(0.0006) 2 (8)(0.0006) 2 

VA < a < y A „ , . or 0.00022 < a 2 < 0.00357. 


21.955 


1.344 


s 2 = 225 with v = 9 degrees of freedom. Also, X0.005 = 23.589 and X0.995 = 1-735. 
Hence, 

(9)(225) 2 (9) (225) 2 

IS W <a or 85.845 < a 2 < 1167.147, 

which yields 9.27 < a < 34.16. 

s 2 = 2.25 with v = 11 degrees of freedom. Also, Xo.w, = 19.675 and X0.95 = 4.575. 
Hence, 

(11)(2.25) 2 (11) (2.25) 2 

V A ’ <a 2 < K \ or 1.258 < a 2 < 5.410. 


19.675 


4.575 


sf = 1.00, s 2 2 = 0.64, /o.oi(H, 9) = 5.19, and f 0 . 01 (9, 11) = 4.63. So, 

1,0 r °y 4 < % < (1.00/0.64)(4.63), or 0.301 < % < 7.234, 

5.19 ol cr£ 

which yields 0.549 < ^ < 2.690. 

J cr 2 

s 2 = 5000 2 ,s| = 6100 2 , and /o.o5(H, 11) = 2.82. (Note: this value can be found by 
using “=finv(0.05,ll,ll)” in Microsoft Excel.) So, 


5000\ 2 1 . a 2 75000V 


6100 ) 2.82 " a 2 


< -k < 


V6100 y 


(2.82), or 0.238 < ^ < 1.895. 


crs 


Since the interval contains 1, it is reasonable to assume that erf — o\. 
sj = 76.3, V/ = 1035.905, /o. 05 (4, 6) = 4.53, and / 0 . 05 ( 6,4) = 6.16. So, 


76.3 


1035.905 / \ 4.53 


a 2 

<^r< 

°n 


76.3 


1035.905 


07 


(6.16), or 0.016 < -f < 0.454. 


a 


11 


Hence, we may assume that erf 7^ erfj 
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9.80 sa = 0.7794, sb = 0.7538, and /o.o25(14, 14) = 2.98 (Note: this value can be found by 
using “=finv(0.025,14,14)” in Microsoft Excel.) So, 


0.7794 \ 2 / 1 \ o\ ( 0.7794\ 
0.7538 J V^98/ < of < V 0 - 7538 / 


(2.98), or 0.623 < A < 3.186. 

a B 


Hence, it is reasonable to assume the equality of the variances. 


9.81 The likelihood function is 

n n 

L(xi, ...,x n ) =]\f(x i -,p) = l[p Xi (l -p) 1 -** = p n ~ x (l-p) n ^~~ x \ 

2=1 2=1 

Hence, hi L = n[xln(p) + (1 — x)ln(l — p)]. Taking derivative with respect to p 
and setting the derivative to zero, we obtain — n ~ T^f) = *1 which yields 

- — 7 — ^ = 0. Therefore, p = x. 

p 1 —p ’ £ 

9.82 (a) The likelihood function is 


L{x i, . . . , x n ) = f(xi ; a, (3) = ( a(3) n JJ xf 1 


" e aa 4 


2=1 


A / 3 

—a > x / — r 

= (aB)"e - H 


2=1 

n \ y - 1 
Xi 

1=1 


(b) So, the log-likelihood can be expressed as 


InL = n[ln(a) + ln(/?)] — a + (p- 1 )^ ln(xj). 

2=1 2=1 


To solve for the maximum likelihood estimate, we need to solve the following two 
equations 


<9 In L 
da 


0 , and 


<9 In L 

d(3 


0 . 


9.83 (a) The likelihood function is 


L(x i, , x n ) = JJ f(xi ; p, a) = JJ 

2=1 2=1 

1 

= n eX P 

(27r) n / 2 cr n H Xi 
2=1 


\/2ni 


[In (xj)-IJ} 

2CJ 1 


7T CXi 


1 

2^2 


^[ln(xi) -p] 2 
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(b) It is easy to obtain 


n -j n 

In L = ~ ln(2vr) - ^ ln(u 2 ) - ^ Info) - — ^[ln(x,) - pf 


2a 2 

2= 1 i=l 


So, setting 0 = ^ S[^ n ( x *) ~ lAi we obtain ft = Info), and setting 

M a i=i n i=l 

n n 

o = + 2^4 EP n fo) - /i] 2 , we get b 2 = ± £[ ln fo) - A] 2 - 

9.84 (a) The likelihood function is 


2=1 


a— 1 


L(x i, ...,x n ) = 


/3 na T(a) 


nhr'e - '*" 5 ) 


2=1 


/3 nQ r(a) 


ri r 

0 = 1 


- E Ob/b 
e i=1 


(b) Hence 


InL = —na\n(/3) — nln(r(o;)) + (a — 1) Info) — — Xj. 

i= i P i= 1 

Taking derivatives of In L with respect to a and /3, respectively and setting both 
as zeros. Then solve them to obtain the maximum likelihood estimates. 

9.85 L(x) = p x ( 1 — pY~ x , and InL = x ln(p) + (1 — x) ln(l — p), with | — jfo = 0, 

we obtain p = x = 1. 

9.86 From the density function b*(x;p) = (?lj)p fe (l — p) x ~ k , we obtain 

hr - 1\ 

In L — In ( ) + k In p + (n — k ) ln(l — p). 

k — 1 ) 


Setting diiiT = k _ = o, we obtain p=K 

° ap p 1—p ’ x n 

9.87 For the estimator S' 2 , 


Var(S 2 ) = 


(n - 1) : 


-V ar 


E 1 

2=1 


Xn ~ X) 


(n - 1) : 


-Var{a Xn -i 


1 4 r / , n 2 a 4 

7 — cr 4 2(n - 1) = 

(n - l) 2 L v n n- 1 




rr 


For the estimator a 2 , we have 



Solutions for Exercises in Chapter 9 


115 


9.88 


9.89 


9.90 


9.91 


9.92 


9.93 


9.94 


n — 7, d — 3.557, Sd = 2.776, and to.025 = 2.447 with 6 degrees of freedom. So, 


3.557 ± (2.447) 


2.776 

~7T 


3.557 ±2.567, 


which yields 0.99 < po < 6.12. Since 0 is not in the interval, the claim appears valid. 

n = 75, x = 28, hence p = || = 0.3733. Since 20.025 = 1.96, a 95% confidence interval 
for p can be calculate as 


0.3733 ± (1.96) 


(0.3733)(0.6267) 

75 


0.3733 ±0.1095, 


which yields 0.2638 < p < 0.4828. Since the interval contains 0.421, the claim made 
by the Roanoke Times seems reasonable. 


n — 12, d, — 40.58, Sd = 15.791, and to.025 = 2.201 with 11 degrees of freedom. So, 

15 791 

40.58 ± (2.201) — = 40.58 ± 10.03, 

v 12 

which yields 30.55 < po < 50.61. 

n = 6, d = 1.5, Sd = 1.543, and to.025 = 2.571 with 5 degrees of freedom. So, 

, . 1.543 

1.5 ± (2.571)— pr- = 1.5 ± 1.62, 
v6 

which yields —0.12 < pr, < 3.12. 

n — 12, d — 417.5, Sd = 1186.643, and to.05 = 1.796 with 11 degrees of freedom. So, 

417.5 ± (1.796) 118 ^ 43 = 417.5 ± 615.23, 
v 12 

which yields —197.73 < po < 1032.73. 


rip = n v = 8,x p = 86, 250.000, x u = 79, 837.500, a p = a u = 4,000, and 2:0.025 = 1-96. 
So, 

(86250 - 79837.5) ± (1.96)(4000)</L/8 ± 1/8 = 6412.5 ± 3920, 

which yields 2,492.5 < p p — p u < 10,332.5. Hence, polishing does increase the average 
endurance limit. 


riA = 100 , ub = 120 ,p A = ^5 = 0.24, p B = ^ = 0.30, and 20.025 = 1-96. So, 


(0.30- 0.24) ± (1.96) '/±M|W+ C0-30)C0.70) 


100 


120 


0.06 ±0.117, 


which yields —0.057 < Pb ~ Pa < 0.177. 
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9.95 n N = n 0 = 23, s% = 105.9271,4 = 77.4138, and f om 5 (22,22) = 2.358. So, 


105.9271 1 


< 


<x 


N 


77.4138 2.358 a\ 


< 


o 


105.9271 

77.4138 


(2.358), or 0.58 < 


N 


< 3.23. 


o 


For the ratio of the standard deviations, the 95% confidence interval is approximately 

0.76 < — < 1.80. 
ao 

Since the intervals contain 1 we will assume that the variability did not change with 
the local supplier. 

9.96 tia = ub = 6,xa = 0.1407, xb = 0.1385, sa = 0.002805, sb = 0.002665, and s p = 
002736. Using a 90% confidence interval for the difference in the population means, 
to . 05 = 1.812 with 10 degrees of freedom, we obtain 

(0.1407 - 0.1385) ± (1.812) (0.002736) \A/6 + 1/6 = 0.0022 ± 0.0029, 

which yields —0.0007 < Pa—^b < 0.0051. Since the 90% confidence interval contains 0, 
we conclude that wire A was not shown to be better than wire H, with 90% confidence. 


9.97 To calculate the maximum likelihood estimator, we need to use 


In L 


( 


In 


E x 

e n/i /n =1 


. n %i'- 

\ i=i 


A 


n n 

-rip + ln(/i) x i ~ ln(J^Xj!). 

i=l i= 1 


Taking derivative with respect to p and setting it to zero, we obtain (x — ^ ^ Xi — x. 

i= 1 

On the other hand, using the method of moments, we also get p — x. 

n 

9.98 p = x and a 2 = 4i J2( x i ~ x ) 2 - 

i= 1 

9.99 Equating x = e At+fj2 / 2 and s 2 = (e 2/i+cr2 )(e CT2 — 1), we get ln(x) = p+ or p = ln(x) — 

On the other hand, ln.s 2 = 2p + a 2 + ln(e CT “ — 1). Plug in the form of p, we obtain 

* 2 = i"(i + #)' 

9.100 Setting x = a/3 and s 2 = a(3 2 : we get a — 4? and /3 = y. 


9.101 n i = n 2 = 300, X\ = 102300, x 2 = 98500, si = 5700, and s 2 = 3800. 


(a) 2 o.oo 5 — 2.575. Hence, 


(102300 - 98500) ± (2.575) 


5700 2 

300 


3800 2 
+ 300 


3800 ± 1018.46, 


which yields 2781.54 < p\ — p 2 < 4818.46. There is a significant difference in 
salaries between the two regions. 
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(b) Since the sample sizes arc large enough, it is not necessary to assume the normality 
due to the Central Limit Theorem. 

(c) We assumed that the two variances are not equal. Here we are going to obtain 
a 95% conhdence interval for the ratio of the two variances. It is known that 
/o.o2 5 (299,299) = 1.255. So, 


5700 

3800 


1 ^ of / 5700\ 

1.255 < of < V38007 


(1.255), or 1.793 < % < 2.824. 

Cr 2 


Since the conhdence interval does not contain 1, the difference between the vari- 
ances is significant. 


9.102 The error in estimation, with 95% conhdence, is (1. 
tity to 1000, we obtain 

( l-96)(4000) \ 2 
1000 ) 


Equating this quan- 


n — 2 


when round up. Hence, the sample sizes in Review Exercise 9.101 is sufficient to 
produce a 95% conhdence interval on pi — p 2 having a width of $1,000. 


9.103 n = 300, x = 6.5 and s = 2.5. Also, 1 — a — 0.99 and 1 — 7 = 0.95. Using Table A. 7, 
k = 2.522. So, the limit of the one-sided tolerance interval is 6.5+ (2.522) (2.5) = 12.805. 
Since this interval contains 10, the claim by the union leaders appears valid. 


9.104 n = 30, £ = 8, and £0.025 = 1-96. So, 


15 V 30 

which yields 0.108 < p < 0.425. 

9.105 n = (1 ” |2 + 1 g (11/15) = 301, when round up, 

9.106 rii = n 2 = 100, pi = 0.1, and p 2 = 0.06. 

(a) £0.025 = 1.96. So, 


= —±0.158, 
15 


(0.1 


0.06) ± (1.96) 


(0.1)(0.9) (0.06)(0.94) 


100 


100 


0.04 + 0.075, 


which yields —0.035 < Pi — P 2 < 0.115. 

(b) Since the conhdence interval contains 0, it does not show sufficient evidence that 
Pi > Vi- 
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9.107 n = 20 and s 2 = 0.045. It is known that X 0.025 = 32.825 and X 0.975 = 8-907 with 19 
degrees of freedom. Hence the 95% confidence interval for cr 2 can be expressed as 


(19)(0.045) 2 (19)(0.045) 

32.825 ° 8.907 


or 0.012 <a 2 < 0.045. 


Therefore, the 95% confidence interval for a can be approximated as 


0.110 < a < 0.212. 


Since 0.3 falls outside of the confidence interval, there is strong evidence that the 
process has been improved in variability. 

9.108 riA = ns = 15, 7/a = 87, sa = 5.99, ys = 75, sb = 4.85, s p = 5.450, and £ 0.025 = 2.048 
with 28 degrees of freedom. So, 


(87- 75) ± (2.048)(5.450) 



12 ±4.076, 


which yields 7.924 < /ua — Tb < 16.076. Apparently, the mean operating costs of type 
A engines are higher than those of type B engines. 


9.109 Since the unbiased estimators of erf and cr 2 are Sf and S 2 , respectively, 

ZP/C 2 N 1 i 7 „ , /„ 1 \ z 7 v c» 2 m (ri! - l)a 2 + (n 2 - l)a 2 

E(S > = n, + »2 — 2 ni " 1)£(Sl) + ( " 2 “ 1)E(S ’ )] = n[ + re -2 — 2 ' 

If we assume erf = o\ = a 2 , the right hand side of the above is cr 2 , which means that 
S 2 is unbiased for a 2 . 


9.110 n — 15, a; = 3.2, and s = 0.6. 

(a) to.oi = 2.624 with 14 degrees of freedom. So, a 99% left-sided confidence interval 
has an upper bound of 3.2 ± (2.624)^== = 3.607 seconds. We assumed normality 
in the calculation. 

(b) 3.2 ± (2.624) (0.6 ) yjl ± A — 4.826. Still, we need to assume normality in the 
distribution. 

(c) 1 — a = 0.99 and 1 — 7 = 0.95. So, k = 3.520 with n = 15. So, the upper bound 
is 3.2 ± (3.520) (0.6) = 5.312. Hence, we are 99% confident to claim that 95% of 
the pilot will have reaction time less than 5.312 seconds. 

9.111 n = 400, x = 17, so p = ^ = 0.0425. 

(a) 2:0.025 = 1 - 96 . So, 

0,0425 ± (1.96) /(°-M25)(0.9575) = 
v ; V 400 


which yields 0.0227 < p < 0.0623. 
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(b) 2 o.o 5 = 1.645. So, the upper bound of a left-sided 95% confidence interval is 
0.0425 + (1.645) 0 ^ 25 4 ) o ( o °- 9575) = 0.0591. 

(c) Using both intervals, we do not have evidence to dispute suppliers’ claim. 




Chapter 10 

One- and Two-Sample Tests of 
Hypotheses 


10.1 (a) Conclude that fewer than 30% of the public are allergic to some cheese products 

when, in fact, 30% or more are allergic. 

(b) Conclude that at least 30% of the public are allergic to some cheese products 
when, in fact, fewer than 30% are allergic. 

10.2 (a) The training course is effective. 

(b) The training course is effective. 

10.3 (a) The firm is not guilty. 

(b) The firm is guilty. 

10.4 (a) a = P(X < 5 | p = 0.6) + P(X > 13 | p = 0.6) = 0.0338 + (1 - 0.9729) = 0.0609. 

(b) /3 = P(6 < X < 12 | p = 0.5) = 0.9963 - 0.1509 = 0.8454. 

P = P(6 < X < 12 j p = 0.7) = 0.8732 - 0.0037 = 0.8695. 

(c) This test procedure is not good for detecting differences of 0.1 in p. 

10.5 (a) a = P(X < 110 | p = 0.6) + P(X > 130 | p = 0.6) = P(Z < -1.52) + P(Z > 

1.52) = 2(0.0643) = 0.1286. 

(b) p = P(110 < X < 130 | p = 0.5) = P(1.34 < Z < 4.31) = 0.0901. 

p = P(110 < X < 130 j p = 0.7) = P(— 4.71 < Z < -1.47) = 0.0708. 

(c) The probability of a Type I error is somewhat high for this procedure, although 
Type II errors are reduced dramatically. 

10.6 (a) a = P(X < 3 | p = 0.6) = 0.0548. 

(b) p = P(X > 3 | p = 0.3) = 1 - 0.6496 = 0.3504. 

P = P(X > 3 j p = 0.4) = 1 - 0.3823 = 0.6177. 

P = P(X > 3 j p = 0.5) = 1 - 0.1719 = 0.8281. 
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10.7 (a) a = P(X < 24 | p = 0.6) = P(Z < -1.59) = 0.0559. 

(b) p = P(X > 24 | p = 0.3) = P(Z > 2.93) = 1 - 0.9983 = 0.0017. 

p = P(X > 24 | p = 0.4) = P(Z > 1.30) = 1 - 0.9032 = 0.0968. 

p = P(X > 24 | p = 0.5) = P(Z > -0.14) = 1 - 0.4443 = 0.5557. 

10.8 (a) n =12 ,p = 0.7, and a = P(X > 11) = 0.0712 + 0.0138 = 0.0850. 

(b) n =12 ,p = 0.9, and p = P(X < 10) = 0.3410. 

10.9 (a) n = 100, p = 0.7, // = rip — 70, and o = y Tvpq = a / (100)(0.7)(0.3) = 4.583. 

Hence z = 8 ^g~ 3 70 = 0.3410. Therefore, 

a = P(X > 82) = P(Z > 2.73) = 1 - 0.9968 = 0.0032. 

(b) n = 100, p = 0.9, fi — np = 90, and o = y/npq = \J (100)(0.9)(0.1) = 3. Hence 
z = -2.5. So, 

P = P(X < 82) = P(X < -2.5) = 0.0062. 
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10.14 


10.15 


10.16 


10.17 


So 


a = 2 P(Z < -2.60) = (2)(0.0047) = 0.0094. 


Also, from Exercise 10.12(b) we have /i = 192 and o = 9.992, with 


Zl 


214.5 - 192 

= 2.25, 

9.992 


and z 2 


265.5 - 192 
9.992 


7.36. 


Therefore, 


p = P{ 2.25 < Z < 7.36) = 1 - 0.9878 = 0.0122. 


(a) n — 50, p — 15, a = 0.5, and oxj = = 0.071, with z = 14 Q 9 071 15 = —1.41. 

Hence, a = P(Z < —1.41) = 0.0793. 

(b) If p = 14.8, 2 = 14 0 9 ~ 7 \ 4 - 8 = 1.41. So, p = P(Z > 1.41) = 0.0793. 

If p = 14.9, then z — 0 and P = P(Z > 0) = 0.5. 

(a) n = 200, n — 9, a — 15 and <7x = t = So, 

191 - 200 , 209 - 200 

zi = = —1.8, and z 2 = = 1.8, 

5 5 

with a = 2 P(Z < -1.8) = (2)(0.0359) = 0.0718. 

(b) If p = 215, then z-l = = -4.8 and = -1.2, with 

p = P(— 4.8 < Z < -1.2) = 0.1151 - 0 = 0.1151. 

(a) When n = 15, then = y = 3, with /i = 200 and n = 25. Hence 

191 - 200 0 , 209 - 200 0 

zi = = -3, and ^ = 3, 

with a = 2 P(Z < -3) = (2)(0.0013) = 0.0026. 

(b) When n = 215, then .3 - 1 - = -8 and z 2 = ^215 _ _ 2 , wit h 

p = p(_ 8 < Z < -2) = 0.0228 - 0 = 0.0228. 

(a) n = 50, p = 5000, cr = 120, and 0 % = = 16.971, with 3 = 49 ^~^ 00 = -1.77 

and a = P(Z < -1.77) = 0.0384. 

(b) If n — 4970, then z = 0 and hence P = P(Z > 0) = 0.5. 

If fi = 4960, then 3 = 49 g~ 4 f° = 0.59 and p = P{Z > 0.59) = 0.2776. 


10.18 The OC curve is shown next. 
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OC curve 



10.19 The hypotheses are 


H 0 : p = 800, 

H x : fi ± 800. 

Now, 2 = -1.64, and P-v alue= 2 P(Z < -1.64) = (2) (0.0505) = 0.1010. 

Hence, the mean is not significantly different from 800 for a < 0.101. 

10.20 The hypotheses are 


Hq : p = 5.5, 

Hi : p < 5.5. 

Now, z = = —9.0, and P-value— P(Z < —9.0) ~ 0. The White Cheddar 

Popcorn, on average, weighs less than 5.5oz. 

10.21 The hypotheses are 

H 0 : p = 40 months, 

H\ : p < 40 months. 

Now, z = — —2.76, and P-value= P(Z < —2.76) = 0.0029. Decision: reject 

Po- 

10. 22 The hypotheses are 


Hq : p = 162.5 centimeters, 

H i : /i / 162.5 centimeters. 

Now, * = 16 6 5 9 2 / -^g- 5 = 2.77, and P- valuer 2 P(Z > 2.77) = (2) (0.0028) = 0.0056. 
Decision: reject H 0 and conclude that p ^ 162.5. 
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10.23 The hypotheses are 

Hq : /i = 20, 000 kilometers, 

Hi : p > 20, 000 kilometers. 

Now, z = 2 ' 39 oo/^o° 0 00 — 8-97, and P-value= P(Z > 8.97) ~ 0. Decision: reject H 0 and 
conclude that p ^ 20, 000 kilometers. 

10.24 The hypotheses are 

H 0 : p = 8, 

H\ : fi > 8. 

Now, z = - = 3.33, and P-value= P(Z > 3.33) = 0.0004. Decision: Reject H 0 
and conclude that men who use TM, on average, mediate more than 8 hours per week. 

10.25 The hypotheses are 


H 0 ■ 0 = 10, 

Hi : p ^ 10 . 


a = 0.01 and df = 9. 

Critical region: t < —3.25 or t > 3.25. 
Computation: t = = 0.77. 

Decision: Fail to reject Hq. 


10.26 The hypotheses are 


Ho : n = 220 milligrams, 
H i : fi > 220 milligrams. 


a = 0.01 and df = 9. 

Critical region: t > 1.729. 

Computation: t = = 4.38. 

Decision: Reject Ho and claim p > 220 milligrams. 

10.27 The hypotheses are 


Ho '■ dl ~ d2i 
H\ : // 1 > 


Since s r = = 10.35, then 


P 


T > 


34.0 


10.35^/1/30 + 1/30 


= P(Z > 12.72) « 0. 


Hence, the conclusion is that running increases the mean RMR in older women. 
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10.28 The hypotheses are 


Hq : pc — pAi 
Hi '■ Pc > Pa , 


with s p = = x 3807 We obtain t = 20 . 0 — 12.0 = 20 . 48 . Since 

p V 48 1.3807^/2/25 

P{T > 20.48) ~ 0, we conclude that the mean percent absorbency for the cotton fiber 
is significantly higher than the mean percent absorbency for acetate. 

10.29 The hypotheses are 


Hq : p = 35 minutes, 
Hi : p < 35 minutes. 


ol = 0.05 and df = 19. 

Critical region: t < —1.729. 

Computation: t = 'j 3 ^ || = —1.98. 

Decision: Reject Hq and conclude that it takes less than 35 minutes, on the average, 
to take the test. 

10.30 The hypotheses are 


Hq : pi — P2, 
Hi : pi ^ p 2 - 


Since the variances are known, we obtain z 
and we conclude that p\ > p 2 ■ 


81-76 

a/5.2 2 /25+3.5 2 /36 


4.22. So, P-value~ 0 


10.31 The hypotheses are 


H 0 : pa — Pb — 12 kilograms, 
Hi : pa — Pb > 12 kilograms. 


a = 0.05. 

Critical region: z > 1.645. 

Computation: z = ( 86 . 7 - m . 8)-12 = _ _ 2 .60. So, fail to reject H 0 and conclude that 

1 yj (6.28) 2 /50+(5.61) 2 /50 ’ J 

the average tensile strength of thread A does not exceed the average tensile strength 
of thread B by 12 kilograms. 


10.32 The hypotheses are 


Hq : pi — p-2 — $2, 000, 

Hi : pi — P 2 Z> $2, 000. 
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a = 0 . 01 . 

Critical region: z > 2.33. 

Computation: * = ( 70750 - 652 oo)- 2 ooq = g 43 with a p_ value= p(z > 6.43) « 

(6000) 2 /200+(5000) 2 /200 v ' 

0. Reject H 0 and conclude that the mean salary for associate professors in research 
institutions is $2000 higher than for those in other institutions. 

10.33 The hypotheses are 

Hq : pi — /i 2 = 0.5 micromoles per 30 minutes, 

Hi : pi — fi 2 > 0.5 micromoles per 30 minutes. 


a = 0 . 01 . 

Critical region: t > 2.485 with 25 degrees of freedom. 

Computation: si = (11)(1 " J ) _ 1.3936 and t = (*■*>- _ 3.59 . d q 

V v 25 vT8936 a/1/15+1/12 

not reject H 0 . 

10.34 The hypotheses are 


Ho : pi — /i2 — 8, 

Hi : pi - fi 2 < 8 . 

Computation: s 2 v = = 3L3g5 and t = = _ Q g2 

^ p 26 ’ %/3L395yh/ll+l/17 

Using 28 degrees of freedom and Table A. 4, we obtain that 0.15 < P - value < 0.20. 
Decision: Do not reject Hq. 

10.35 The hypotheses are 


Hq : Hi — E 2 — 0 , 

Hi : pi — H 2 < 0 . 


a = 0.05 

Critical region: t < —1.895 with 7 degrees of freedom. 

Computation: s p = sj {3)(1 -363)+(4) (3 1 883)_ = Lg74> and t = _ +075-2^860^ = _ 0 .70. 
Decision: Do not reject Ho. 

10.36 The hypotheses are 


Ho '■ Ei — h2, 
H\ : // 1 ^ /i 2 . 


Computation: s p = ^/ 5 10 ° 2 + 590 ° 2 = 55 1 5) and t = = -0.84. 

Using 22 degrees of freedom and since 0.20 < P{T < —0.84) < 0.3, we obtain 0.4 < 
P-value < 0.6. Decision: Do not reject H 0 . 
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10.37 The hypotheses are 

H 0 : pi — P 2 = 4 kilometers, 

H\ : pi — p 2 A 4 kilometers. 

a = 0.10 and the critical regions are t < —1.725 or t > 1.725 with 20 degrees of 
freedom. 

Computation: t = 5 T 1 = 2.55. 

(0.915) + 1/12+1/10 

Decision: Reject H 0 . 

10.38 The hypotheses are 


Ho '■ hi — P 2 — 8 , 

Hi : pi - p 2 < 8. 


a = 0.05 and the critical region is t < —1.714 with 23 degrees of freedom. 
Computation: s p = . A 9 H 3 - 2 ) +(i4)(2.8) _ 2.963, and t = 7 5 ~ 8 = —2.07. 

^ P V 23 ’ 2.963^/1/10+1/15 

Decision: Reject H 0 and conclude that pi — /i 2 < 8 months. 

10.39 The hypotheses are 


Ho '■ pii — pi — 10, 
H\ : pjj — p./ > 10. 


a = 0 . 1 . 

Degrees of freedom is calculated as 

(78.8/5 + 913.333/7) 2 = 

V (78.8/5) 2 /4 + (913/333/7) 2 /6 ' ’ 

hence we use 7 degrees of freedom with the critical region t > 2.998. 
Computation: t = (iio-o^p-io _ q ^2. 

78.800/5+913.333/7 

Decision: Fail to reject Hq. 

10.40 The hypotheses are 


Ho ■ Ps ~ Pn, 
Hi : Ps A Pn • 


Degrees of freedom is calculated as 

(0.391478 2 /8 + 0.214414 2 /24) 2 _ o 

V ~ (0.391478 2 /8)2/7 + (0.214414 2 /24) 2 /23 “ 

Computation: t = o. 97625 -o. 9 i 583 = _ 0 . 4 2. Since 0.3 < PIT < -0.42) < 0.4, 

y/ 0.391478 2 /8+0.214414 2 /24 v y 

we obtain 0.6 < P-value < 0.8. 

Decision: Fail to reject H 0 . 
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10.41 The hypotheses are 


Hq : pi — p 2 1 

H\ : m ^ /i 2 . 


a = 0.05. 

Degrees of freedom is calculated as 

(7874.329 2 /16 + 2479/503 2 /12) 2 , , 

” = (7S74.329 2 /16) 2 /15 + (2479.503 2 /12) 2 /ll = 19 degleeS ° f freedom ' 

Critical regions t < —2.093 or t > 2.093. 

Computation: t = 989 1.500—1120. 8.33 _ 2 . 76 . 

v /7874.329 2 /16+2479.503 2 /12 

Decision: Reject H 0 and conclude that (i\ > p 2 . 

10.42 The hypotheses are 


Hq : Pi — p 2 , 

H\ : // 1 ^ H'2 ■ 


a = 0.05. 

Critical regions t < —2.776 or t > 2.776, with 4 degrees of freedom. 

Computation: d = —0.1, Sd = 0.1414, so t — 0 = — 

Decision: Do not reject Ho and conclude that the two methods are not significantly 
different. 

10.43 The hypotheses are 


Ho '■ pi — P2, 

H\ : > I-I2 ■ 

Computation: d = 0.1417, Sd = 0.198, t = - = 2.48 and 0.015 < P-value < 0.02 

with 11 degrees of freedom. 

Decision: Reject Ho when a significance level is above 0.02. 

10.44 The hypotheses are 


H 0 : pi — p 2 =4.5 kilograms, 

Hi : pi — p 2 < 4.5 kilograms. 

Computation: d = 3.557, Sd = 2.776, t = | = —0.896, and 0.2 < P-value < 0.3 
with 6 degrees of freedom. 

Decision: Do not reject H 0 . 
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10.45 The hypotheses are 


Ho '■ hi — A*2 ? 

H\ : hi < h2- 

Computation: d = —54.13, Sd = 83.002, t = 83 qq 2+/T5 = — 2.53, and 0.01 < P-value < 
0.015 with 14 degrees of freedom. 

Decision: Reject H 0 . 

10.46 The hypotheses are 


Hq : hi — h2, 
Hi : hi ^ h2- 


a = 0.05. 

Critical regions are t < —2.365 or t > 2.365 with 7 degrees of freedom. 

Computation: d = 198.625, s d = 210.165, t = = 2.67. 

Decision: Reject H 0 ; length of storage influences sorbic acid residual concentrations. 

10.47 n = (L645+1 0 2 g )2(0 - 24)2 = 5.48. The sample size needed is 6. 

10.48 [3 = 0.1, a = 5.8, 5 = 35.9 — 40 = —4.1. Assume a = 0.05 then z 0 , 05 = 1.645, 
z o.io — 1-28. Therefore, 


(1.645 + 1.28) 2 (5.8) 2 


17.12 pa 18 due to round up. 


10.49 1 — P = 0.95 so (3 = 0.05, 5 = 3.1 and z 0 . oi = 2.33. Therefore, 

(1.645 + 2.33) 2 (6.9) 2 


n = 


3.1 2 


= 78.28 ~ 79 due to round up. 


10.50 (3 = 0.05, 5 = 8, a = 0.05, z 0 . 05 = 1.645, cy = 6.28 and cr 2 = 5.61. Therefore, 
(1.645 + 1.645) 2 (6.28 2 + 5.61 2 ^ 


n = 


8 2 


= 11.99 ~ 12 due to round up. 


10.51 n = L(>1 [ > (t 0 2) 8 (2 2 25)p 25) = 4-29. The sample size would be 5. 


10.52 a = 1.25, a = 0.05, (3 = 0.1, <5 = 0.5, so A = = 0.4. Using Table A. 8 we find 

n = 68. 


Hq '■ M hot — M co i d — 0, 

Hi : M hot - M cold ^ 0. 


10.53 (a) The hypotheses are 
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(b) Use paired T-test and find out t = 0.99 with 0.3 < P-value < 0.4. Hence, fail to 
reject H 0 . 

10.54 Using paired T-test, we find out t = 2.4 with 8 degrees of freedom. So, 0.02 < 
P-value < 0.025. Reject Ho] breathing frequency significantly higher in the presence 
of CO. 


10.55 The hypotheses are 

H 0 : p = 0.40, 

Hi : p > 0.40. 

Denote by X for those who choose lasagna. 

P-value = P(X > 9 | p = 0.40) = 0.4044. 

The claim that p = 0.40 is not refuted. 

10.56 The hypotheses are 

Hq ■ p — 0.40, 

Hi : p > 0.40. 

a = 0.05. 

Test statistic: binomial variable X with p = 0.4 and n = 15. 

Computation: x = 8 and npo = (15) (0.4) = 6. Therefore, from Table A.l, 

P-value = P(X > 8 | p = 0.4) = 1 - P(X <7 \p = 0.4) = 0.2131, 

which is larger than 0.05. 

Decision: Do not reject Ho. 

10.57 The hypotheses are 

Ho : p = 0.5, 

Hi:p< 0.5. 

P-value = P(X < 5 | p = 0.05) = 0.0207. 

Decision: Reject H 0 . 

10.58 The hypotheses are 

Ho ■ P = 0.6, 

Hi : p < 0.6. 

So 


10 - ( 200 )( 0 . 6 ) 
/ (200)(0.6)(0.4) 


= P(Z < -1.44) = 0.0749. 


Decision: Fail to reject H 0 . 
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10.59 The hypotheses are 


H 0 :p = 0.2, 

Hi : p < 0.2. 


Then 


P - value « P 



136- (1000)(0.2) \ 

(1000) (0.2) (o^iy J 


P(Z 


Decision: Reject Hy less than 1/5 of the homes in the city 

10.60 The hypotheses are 


< -5.06) « 0. 
are heated by oil. 


H 0 : p = 0.25, 
Hi : p > 0.25. 


a = 0.05. 
Computation: 


P-value ~ P Z > 


28 — (90)(0.25) 


= P(Z > 1,34) = 


y/ (90)(0.25)(0.75)^ 

Decision: Fail to reject H 0 ; No sufficient evidence to conclude that p 


0.091. 
> 0.25. 


10.61 The hypotheses are 


Ho : p = 0.8, 

Hi : p > 0.8. 


a = 0.04. 

Critical region: z > 1.75. 

Computation: z = 2 -5°-( 300 H°- 8 > — 1,44, 

^/( 300 )( 0 . 8 )( 0 . 2 ) 

Decision: Fail to reject H 0 ; it cannot conclude that the new missile system is more 
accurate. 


10.62 The hypotheses are 


H 0 : p = 0.25, 
Hi : p > 0.25. 


a = 0.05. 

Critical region: z > 1.645. 

Computation: z = 1(, ~ ( _ 1.333. 

\J ( 48 )( 0 . 25 )( 0 . 75 ) 

Decision: Fail to reject Ho- On the other hand, we can calculate 


P-value = P(Z > 1.33) = 0.0918. 
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10.63 The hypotheses are 


H 0 ■ Pi = P2, 

Hi : pi ^ p 2 . 


Computation: p = 


63+59 


= 0.5422, 


z = 


(63/100) — (59/125) 


\J (0.5422)(0.4578)(1/100+1/125) 


= 2.36, with 


100+125 

P-value = 2 P(Z > 2.36) = 0.0182. 

Decision: Reject Hq at level 0.0182. The proportion of urban residents who favor the 
nuclear plant is larger than the proportion of suburban residents who favor the nuclear 
plant. 


10.64 The hypotheses are 


H 0 ■ Pi = P-2, 

Hi : pi > p 2 . 


(240/300) — (288/400) 
(0.7543)(0.2457)(1/300+1/400) 


= 2.44, with 


Computation: p = 3 qq^ 4 qq = 0.7543, z = 

P-value = P(Z > 2.44) = 0.0073. 

Decision: Reject Hq. The proportion of couples married less than 2 years and planning 
to have children is significantly higher than that of couples married 5 years and planning 
to have children. 


10.65 The hypotheses are 


H 0 ■ Pu = Pr, 
Hi : Pu > Pr- 


Computation: p = 


20+10 


= 0.085714, z = 


(20/200)- (10/150) 


= 1.10, with 


200+150 ’ ’ (0.085714) (0.914286) (1/200+1/150) 

P-value = P(Z > 1.10) = 0.1357. 

Decision: Fail to reject Ho. It cannot be shown that breast cancer is more prevalent 
in the urban community. 


10.66 The hypotheses are 


H 0 ■ Pi = P 2 , 

Hi : pi > p 2 . 


Computation: p = ^qI^so = 0-2125, z = 


(29/120) — (56/280)) 


= 0.93, with 


(0.2125)(0.7875)(l/120+l/280) 

P-value = P(Z > 0.93) = 0.1762. 

Decision: Fail to reject Hq. There is no significant evidence to conclude that the new 
medicine is more effective. 



134 


Chapter 10 One- and Two-Sample Tests of Hypotheses 


10.67 The hypotheses are 

H 0 : a 2 = 0.03, 

H, : a 2 ^ 0.03. 

Computation: y 2 = (9)(0 ^ 0 4 3 585)2 = 18.13. Since 0.025 < P(y 2 > 18.13) < 0.05 with 9 
degrees of freedom, 0.05 < P-value = 2 P(y 2 > 18.13) < 0.10. 

Decision: Fail to reject Ho ; the sample of 10 containers is not sufficient to show that 
a 2 is not equal to 0.03. 

10.68 The hypotheses are 

Ho ■ a = 6, 

Hi : a < 6 . 

Computation: y 2 = (19 ^ 3 4 6 51 - ) = 10.74. Using the table, 1 — 0.95 < P(y 2 < 10.74) < 0.1 
with 19 degrees of freedom, we obtain 0.05 < P-value < 0.1. 

Decision: Fail to reject Ho ; there was not sufficient evidence to conclude that the 
standard deviation is less then 6 at level a = 0.05 level of significance. 

10.69 The hypotheses are 

Ho : cr 2 = 4.2 ppm, 

Hi : a 2 ^ 4.2 ppm. 

Computation: y 2 = — 63.75. Since 0.3 < P(y 2 > 63.75) < 0.5 with 63 

degrees of freedom, P-value = 2P(y 2 > 18.13) > 0.6 (In Microsoft Excel, if you type 
“=2*chidist(63.75,63)”, you will get the P-value as 0.8898. 

Decision: Fail to reject Ho', the variance of aflotoxins is not significantly different from 
4.2 ppm. 

10.70 The hypotheses are 

Ho : cr = 1.40, 

Hi : a > 1.40. 

Computation: y 2 = (11)( [ 1 1 ' ,,) = 17.19. Using the table, 0.1 < P(y 2 > 17.19) < 0.2 
with 11 degrees of freedom, we obtain 0.1 < P-value < 0.2. 

Decision: Fail to reject Ho', the standard deviation of the contributions from the san- 
itation department is not significantly greater than $1.40 at the a = 0.01 level of 
significance. 

10.71 The hypotheses are 

Ho : o 2 = 1.15, 

Hi ■. a 2 > 1.15. 
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Computation: y 2 = (24 ^ 2 1 ° 3 ) = 42.37. Since 0.01 < P(y 2 > 42.37) < 0.02 with 24 
degrees of freedom, 0.01 < P-value < 0.02. 

Decision: Reject P 0 ; there is sufficient evidence to conclude, at level a = 0.05, that 
the soft drink machine is out of control. 

10.72 (a) The hypotheses are 

P 0 : a= 10.0, 

Pi : <4 10.0. 

Computation: z = ^ q^oo = 2-69- So P-value = P(Z < —2.69 )+P(Z > 2.69) = 
0.0072. There is sufficient evidence to conclude that the standard deviation is 
different from 10.0. 

(b) The hypotheses are 

H 0 : a 2 = 6.25, 

Pi : u 2 < 6.25. 


Computation: z = = —1.92. P-value = P(Z < —1.92) = 0.0274. 

Decision: Reject P 0 ; the variance of the distance achieved by the diesel model is 
less than the variance of the distance achieved by the gasoline model. 

10.73 The hypotheses are 

H 0 : af = o\, 

H\ : (j\ > a\. 

Computation: / = = 1.33. Since /o.os ( 10, 13) = 2.67 > 1.33, we fail to reject 

H 0 at level a = 0.05. So, the variability of the time to assemble the product is not 
significantly greater for men. On the other hand, if you use “=fdist(1.33,10,13)”, you 
will obtain the P-value = 0.3095. 

10.74 The hypotheses are 


Ho ■ cr 1 — er 2 , 

Hi : al ^ 4 

Computation: / = [0479 503)2 = 10.09. Since /o.oi(15, 11) = 4.25, the P-value > 
(2) (0.01) = 0.02. Hence we reject Po at level a = 0.02 and claim that the variances 
for the two locations are significantly different. The P-value = 0.0004. 


10.75 The hypotheses are 


H 0 : af = a 2 , 
Hr - 4 
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Computation: / = = 0.086. Since P- value = 2 P(f < 0.086) = (2) (0.0164) = 

0.0328 for 4 and 6 degrees of freedom, the variability of running time for company 1 is 
significantly less than, at level 0.0328, the variability of running time for company 2. 

10.76 The hypotheses are 


Hq : o a — o\b, 
Hi : a A ± a B . 


Computation: / = = 1.15. Since P-value = 2 P(f > 1.15) = (2) (0.424) = 0.848 

for 8 and 8 degrees of freedom, the two instruments appear to have similar variability. 

10.77 The hypotheses are 


H 0 \ Oi — (7 2 , 

H\ \ (j\ ^ cr 2 . 


Computation: / = = 19.67. Since P-value = 2 P(/ > 19.67) = (2) (0.0004) = 

0.0008 for 7 and 7 degrees of freedom, production line 1 is not producing as consistently 
as production 2. 

10.78 The hypotheses are 


Ho '■ <7 1 — cr 2 , 

Hi : ui ^ a 2 . 

Computation: si = 291.0667 and s 2 = 119.3946, / = '3945)2 = 5.54. Since 

P-value = 2 P(/ > 5.54) = (2) (0.0002) = 0.0004 for 19 and 19 degrees of freedom, 
hydrocarbon emissions are more consistent in the 1990 model cars. 

10.79 The hypotheses are 


H 0 : die is balanced, 

H i : die is unbalanced. 


a = 0 . 01 . 

Critical region: y 2 > 15.086 with 5 degrees of freedom. 
Computation: Since e* = 30, for i — 1,2, ... ,6, then 


x 2 = (28 - 30) 2 + (36 - 30) 2 + _ + (23 - 30) 2 = ^ 


30 


30 


30 


Decision: Fail to reject H 0 ; the die is balanced. 
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10.80 The hypotheses are 


H 0 : coin is balanced, 

H\ : coin is not balanced. 


a = 0.05. 

Critical region: y 2 > 3.841 with 1 degrees of freedom. 
Computation: Since e* = 30, for i — 1, 2, . . . , 6, then 


X 


2 


(63 - 50) 2 (37- 50) 2 

50 + 50 


6.76. 


Decision: Reject Ho] the coin is not balanced. 


10.81 The hypotheses are 


H 0 : nuts are mixed in the ratio 5:2:2: 1, 

Hi : nuts are not mixed in the ratio 5:2:2: 1. 


a = 0.05. 

Critical region: y 2 > 7.815 with 3 degrees of freedom. 
Computation: 


Observed 

269 

112 

74 

45 

Expected 

250 

100 

100 

50 


X 


2 


(269 - 250) 2 (112 - 100) 2 (74 - 100) 2 (45 - 50) 2 

250 + 100 + 100 + 50 


10.14. 


Decision: Reject Ho] the nuts are not mixed in the ratio 5:2:2:1. 


10.82 The hypotheses are 

Ho : Distribution of grades is uniform, 

H i : Distribution of grades is not uniform. 


a = 0.05. 

Critical region: y 2 > 9.488 with 4 degrees of freedom. 
Computation: Since e* = 20, for i = 1, 2, . . . , 5, then 


X 


2 


(14-20) 2 | (18-20) 2 | (16-20) 2 

20 + 20 + 20 


10 . 0 . 


Decision: Reject H 0 ; the distribution of grades is not uniform. 
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10.83 The hypotheses are 

H 0 : Data follows the binomial distribution b(y; 3, 1/4), 
H i : Data does not follows the binomial distribution. 


a = 0 . 01 . 

Computation: 6(0; 3, 1/4) = 27/64, 6(1; 3, 1/4) = 27/64, 6(2; 3, 1/4) = 9/64, and 
6(3; 3, 1/4) = 1/64. Hence e 4 = 27, e 2 = 27, e 3 = 9 and e 4 = 1. Combining the 
last two classes together, we obtain 


X 


2 


(21 -27) 2 (31 - 27) 2 (12 - 10) 2 

27 + 27 + 10 


2.33. 


Critical region: y 2 > 9.210 with 2 degrees of freedom. 

Decision: Fail to reject Hy the data is from a distribution not significantly different 
from b(y ; 3, 1/4). 


10.84 The hypotheses are 


H 0 : Data follows the hypergeometric distribution h{x\ 8,3,5), 
Hi : Data does not follows the hypergeometric distribution. 


a = 0.05. 

Computation: 6,(0; 8, 3, 5) = 1/56, 6(1; 8, 3, 5) = 15/56, 6(2; 8, 3, 5) = 30/56, and 
6(3; 8, 3, 5) = 10/56. Hence e 4 = 2, e 2 = 30, e 3 = 60 and e 4 = 20. Combining 
the first two classes together, we obtain 


X 


2 


(32 - 32) 2 | (55 - 60) 2 | (25 - 20) 2 
32 + 60 + 20 


1.67. 


Critical region: y 2 > 5.991 with 2 degrees of freedom. 

Decision: Fail to reject Hy the data is from a distribution not significantly different 
from h(y; 8, 3, 5). 


10.85 The hypotheses are 


Ho ■ f(x) = g(x; 1/2) for x = 1, 2, . . . , 
Hi:f(x)^g(: r; 1/2). 


a = 0.05. 

Computation: g(x ; 1/2) = T., for x — 1, 2, . . . , 7 and P( X > 8) = Hence e 4 = 128, 
e 2 = 64, e 3 = 32, e 4 = 16, = 8, eo = 4, ej = 2 and e 3 = 2. Combining the last three 

classes together, we obtain 


2 (136 - 128) 2 (60 - 64) 2 (34 - 32) 2 (12 - 16) 2 (9 - 8) 2 (5 - 8) 2 

X “ 128 + 64 + 32 + 16 + 8 + 8 

Critical region: y 2 > 11.070 with 5 degrees of freedom. 

Decision: Fail to reject Hy f(x) = g(x ; 1/2), for x = 1,2,... 


3.125 
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10.88 The hypotheses are 

H 0 : Distribution of grades is normal n(x; 65, 21), 
H i : Distribution of grades is not normal. 

a = 0.05. 

Computation: 



A goodness-of-fit test with 6 degrees of freedom is based on the following data: 


Oi 8 

4 

5 

11 

14 

14 

4 

ej 6.8 

7.0 

10.1 

11.1 

10.3 

7.4 

7.3 


Critical region: y 2 > 12.592. 


2 = (S-6.8) 2 (4 - 7.0) 2 (4 — 7.3) 2 

' 6.8 7.0 7.3 


Decision: Reject Hq; distribution of grades is not normal. 


10.89 From the data we have 




140 


Chapter 10 One- and Two-Sample Tests of Hypotheses 


The hypotheses are 

Ho : Distribution of nicotine contents is normal n(x; 1.8, 0.4), 
H i : Distribution of nicotine contents is not normal. 


a = 0 . 01 . 

Computation: A goodness-of-fit test with 5 degrees of freedom is based on the following 
data: 


Oi 

5 

4 

13 

8 

5 

5 

e t 

6.1 

6.0 

7.6 

7.7 

6.0 

6.4 


Critical region: y 2 > 15.086. 


2 (5-6.1) 2 (4-6.0) 2 (5 — 6.4) 2 in 

X = - — „ - + - — „ - H 1- - — „ , ’ = 5.19. 


6.1 


6.0 


6.4 


Decision: Fail to reject Hq ; distribution of nicotine contents is not significantly different 
from n(x] 1.8, 0.4). 


10.90 The hypotheses are 


Hq : Presence or absence of hypertension is independent of smoking habits, 

H\ : Presence or absence of hypertension is not independent of smoking habits. 


a = 0.05. 

Critical region: y 2 > 5.991 with 2 degrees of freedom. 
Computation: 


Observed and expected frequencies 


Nonsmokers 

Moderate Smokers 

Heavy Smokers 

Total 

Hypertension 

21 (33.4) 

36 (30.0) 

30 (23.6) 

87 

No Hypertension 

48 (35.6) 

26 (32.0) 

19 (25.4) 

93 

Total 

69 

62 

49 

180 


^2 = (21 - 33.4) 2 (19- 25.4)" 


33.4 


25.4 


14.60. 


Decision: Reject Hq ; presence or absence of hypertension and smoking habits are not 
independent. 


10.91 The hypotheses are 

Hq : A person’s gender and time spent watching television are independent, 

Hi : A person’s gender and time spent watching television are not independent. 

CL = 0.01. 

Critical region: y 2 > 6.635 with 1 degrees of freedom. 

Computation: 
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Observed and expected frequencies 


Male 

Female 

Total 

Over 25 hours 

15 (20.5) 

29 (23.5) 

44 

Under 25 hours 

27 (21.5) 

19 (24.5) 

46 

Total 

42 

48 

90 


X 


2 


(15 — 20. 5) 2 (29 — 23. 5) 2 (27 - 21.5) 2 (19 - 24.5) 2 

205 + 205 + 2L5 + 245 


5.47. 


Decision: Fail to reject H 0 ; a person’s gender and time spent watching television are 
independent. 


10.92 The hypotheses are 

Ho : Size of family is independent of level of education of father, 

H i : Size of family and the education level of father are not independent. 


a = 0.05. 

Critical region: y 2 > 9.488 with 4 degrees of freedom. 
Computation: 


Observed and expected frequencies 

Education 

Number of Children 

Total 

0-1 

2-3 

Over 3 

Elementary 

14 (18.7) 

37 (39.8) 

32 (24.5) 

83 

Secondary 

19 (17.6) 

42 (37.4) 

17 (23.0) 

78 

College 

12 (8.7) 

17 (18.8) 

10 (11.5) 

39 

Total 

45 

96 

59 



2 _ (14 - 18.7) 2 + (37 - 39.8)" 


+ ••• + 


(10- 11.5)" 


= 7.54. 


18.7 ' 39.8 ' ' 11.5 

Decision: Fail to reject Hq; size of family is independent of level of education of father. 


10.93 The hypotheses are 

H 0 : Occurrence of types of crime is independent of city district, 
Hi : Occurrence of types of crime is dependent upon city district. 

a = 0 . 01 . 

Critical region: y 2 > 21.666 with 9 degrees of freedom. 

Computation: 


Observed and expected frequencies 

District 

Assault 

Burglary 

Larceny 

Homicide 

Total 

1 

162 (186.4) 

118 (125.8) 

451 (423.5) 

18 (13.3) 

749 

2 

310 (380.0) 

196 (256.6) 

996 (863.4) 

25 (27.1) 

1527 

3 

258 (228.7) 

193 (154.4) 

458 (519.6) 

10 (16.3) 

919 

4 

280 (214.9) 

175 (145.2) 

390 (488.5) 

19 (15.3) 

864 

Total 

1010 

682 

2295 

72 

4059 
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2 162 - 186.4 2 118 - 125.8 2 19 - 15.3 2 

v 2 = - — + 4 4 |- 2 — = 124.59. 

A 186.4 125.8 15.3 

Decision: Reject Hy occurrence of types of crime is dependent upon city district. 


10.94 The hypotheses are 


H 0 : The three cough remedies are equally effective, 

H i : The three cough remedies are not equally effective. 


a = 0.05. 

Critical region: y 2 > 9.488 with 4 degrees of freedom. 
Computation: 


Observed and expected frequencies 


NyQuil 

Robitussin 

Triaminic 

Total 

No Relief 

11 (11) 

13 (11) 

9 (11) 

33 

Some Relief 

32 (29) 

28 (29) 

27 (29) 

87 

Total Relief 

7(10) 

9 (10) 

14 (10) 

30 

Total 

50 

50 

50 

150 


_ (11 -ll) 2 (13 -ll) 2 (14-10) = „ 

11 11 10 


Decision: Fail to reject Hy the three cough remedies are equally effective. 
10.95 The hypotheses are 

H 0 : The attitudes among the four counties are homogeneous, 

Hi : The attitudes among the four counties are not homogeneous. 

Computation: 


Observed and expected frequencies 

Attitude 

County 

Total 

Craig 

Giles 

Franklin 

Montgomery 

Favor 

65 (74.5) 

66 (55.9) 

40 (37.3) 

34 (37.3) 

205 

Oppose 

42 (53.5) 

30 (40.1) 

33 (26.7) 

42 (26.7) 

147 

No Opinion 

93 (72.0) 

54 (54.0) 

27 (36.0) 

24 (36.0) 

198 

Total 

200 

150 

100 

100 

550 


x 2 = 


(65 - 74. 5) 2 (66 - 55.9) 


+ 


+ ••• + 


(24 — 36.0)" 


= 31.17. 


74.5 ' 55.9 ' ' 36.0 

Since P - value = P(y 2 > 31.17) < 0.001 with 6 degrees of freedom, we reject H 0 and 
conclude that the attitudes among the four counties are not homogeneous. 
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10.96 The hypotheses are 

Hq : The proportions of widows and widowers are equal with respect to the different 
time period, 

H i : The proportions of widows and widowers are not equal with respect to the 
different time period. 


a = 0.05. 

Critical region: y 2 > 5.991 with 2 degrees of freedom. 
Computation: 


Observed and expected frequencies 

Years Lived 

Widow 

Widower 

Total 

Less than 5 

25 (32) 

39 (32) 

64 

5 to 10 

42 (41) 

40 (41) 

82 

More than 10 

33 (26) 

21 (26) 

54 

Total 

100 

100 

200 


2 (25 - 32) 2 (39 - 32) 2 (21 - 26) 2 

X = _ + _ H h — y = 5.78. 


32 


32 


26 


Decision: Fail to reject Hq] the proportions of widows and widowers are equal with 
respect to the different time period. 


10.97 The hypotheses are 


Hq : Proportions of household within each standard of living category are equal, 

Hi : Proportions of household within each standard of living category are not equal. 


a = 0.05. 

Critical region: y 2 > 12.592 with 6 degrees of freedom. 
Computation: 


Observed and expected frequencies 

Period 

Somewhat Better 

Same 

Not as Good 

Total 

1980: Jan. 

72 (66.6) 

144 (145.2) 

84 (88.2) 

300 

May. 

63 (66.6) 

135 (145.2) 

102 (88.2) 

300 

Sept. 

47 (44.4) 

100 (96.8) 

53 (58.8) 

200 

1981: Jan. 

40 (44.4) 

105 (96.8) 

55 (58.8) 

200 

Total 

222 

484 

294 

1000 




66. 6) 2 (144 


145.2) 2 


66.6 


145.2 


55 -58.8 2 

+ • • • + ^ = 5.92. 

58.8 


Decision: Fail to reject Hq ; proportions of household within each standard of living 
category are equal. 
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10.98 The hypotheses are 

Ho : Proportions of voters within each attitude category are the same for each of the 
three states, 

H i : Proportions of voters within each attitude category are not the same for each of 
the three states. 

a = 0.05. 

Critical region: y 2 > 9.488 with 4 degrees of freedom. 

Computation: 


Observed and expected frequencies 


Support 

Do not Support 

Undecided 

Total 

Indiana 

82 (94) 

97 (79) 

21 (27) 

200 

Kentucky 

107 (94) 

66 (79) 

27 (27) 

200 

Ohio 

93 (94) 

74 (79) 

33 (27) 

200 

Total 

282 

237 

81 

600 


..2 (82 ^ 94) 2 | (97 - 79) 2 | | (33 - 27) 2 ^ 


94 


79 


27 


Decision: Reject H 0 ] the proportions of voters within each attitude category are not 
the same for each of the three states. 


10.99 The hypotheses are 

Ho : Proportions of voters favoring candidate A, candidate B, or undecided are the 
same for each city, 

Hi : Proportions of voters favoring candidate A, candidate B , or undecided are not 
the same for each city. 

a = 0.05. 

Critical region: y 2 > 5.991 with 2 degrees of freedom. 

Computation: 


Observed and expected frequencies 


Richmond 

Norfolk 

Total 

Favor A 

204 (214.5) 

225 (214.5) 

429 

Favor B 

211 (204.5) 

198 (204.5) 

409 

Undecided 

85 (81) 

77 (81) 

162 

Total 

500 

500 

1000 
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2 (204 

X 


214.5) 2 (225 — 214.5)" 


214.5 


+ 


214.5 


+ ••• + 


(77-81) 2 

81 


= 1.84. 


Decision: Fail to reject H 0 ] the proportions of voters favoring candidate A, candidate 
B, or undecided are not the same for each city. 


10.100 The hypotheses are 


H 0 ■ Pi = P 2 = Pa, 

Hi : p i , p 2 , and p^ are not all equal. 


a = 0.05. 

Critical region: y 2 > 5.991 with 2 degrees of freedom. 
Computation: 


Observed and expected frequencies 


Denver 

Phoenix 

Rochester 

Total 

Watch Soap Operas 

52 (48) 

31 (36) 

37 (36) 

120 

Do not Watch 

148 (152) 

119 (114) 

113 (114) 

380 

Total 

200 

150 

150 

500 


x 2 = (52 - 48) 2 + (31 -36) 2 + + (113 - 114) 2 


48 


36 


114 


Decision: Fail to reject H 0 ', no difference among the proportions. 


1.39. 


10.101 The hypotheses are 


H 0 : pi = p 2 , 

Hi : pi > p 2 . 


a = 0.01. 

Critical region: z > 2.33. 

Computation: p\ = 0.31, p 2 = 0.24, p = 0.275, and 

0.31-0.24 

z = - = 1.11. 

a/ (0.275) (0.725) (1/100 + 1/100) 

Decision: Fail to reject H 0 ; proportions are the same. 

10.102 Using paired t- test, we observe that t = 1.55 with P-value > 0.05. Hence, the data 
was not sufficient to show that the oxygen consumptions was higher when there was 
little or not CO. 

10.103 (a) H 0 : p — 21.8, T/j : /i ^ 21.8; critical region in both tails. 

(b) H 0 : p = 0.2, H\ \ p > 0.2; critical region in right tail. 
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(c) Hq : p — 6.2, Hi : p > 6.2; critical region in right tail. 

(d) H 0 : p = 0.7, Hi : p < 0.7; critical region in left tail. 

(e) Hq : p = 0.58, Hi : p ^ 0.58; critical region in both tails. 

(f) H 0 : p = 340, H\ : p < 340; critical region in left tail. 

10.104 The hypotheses are 


Hq : Pi = p 2 , 
Hi : pi > p 2 . 


a = 0.05. 

Critical region: z > 1.645. 

Computation: pi = 0.24, p 2 = 0.175, p = 0.203, and 


0.24-0.175 

\J (0.203) (0.797) (1/300 + 1/400) 


2 . 12 . 


Decision: Reject Hq\ there is statistical evidence to conclude that more Italians prefer 
white champagne at weddings. 


10.105 ri\ = n 2 = 5, x\ = 165.0, Si = 6.442, x 2 = 139.8, s 2 = 12.617, and s p = 10.02. Hence 


165 - 139.8 
(10.02)^1/5 + 1/5 


3.98. 


This is a one-sided test. Therefore, 0.0025 < P-value < 0.005 with 8 degrees of 
freedom. Reject H 0 ] the speed is increased by using the facilitation tools. 


10.106 (a) Hq : p = 0.2, H\ \ p > 0.2; critical region in right tail. 

(b) Hq : p = 3, Hi : p ^ 3; critical region in both tails. 

(c) Hq : p = 0.15, Hi : p < 0.15; critical region in left tail. 

(d) Hq : p = $10, H\ : p > $10; critical region in right tail. 

(e) Hq : p = 9, Hi : p ^ 9; critical region in both tails. 

10.107 The hypotheses are 


Hq : pi = p 2 = p 3 , 

Hi : p i , /+ , and p 3 are not all equal. 


a = 0 . 01 . 

Critical region: y 2 > 9.210 with 2 degrees of freedom. 
Computation: 
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Observed and expected frequencies 


Distributor 

Total 

1 

2 

3 

Peanuts 

Other 

345 (339) 
155 (161) 

313 (339) 
187 (161) 

359 (339) 
141 (161) 

1017 

483 

Total 

500 

500 

500 

1500 


2 = (345 - 339) 2 (313 - 339) 2 (141 - 161) 2 = 

X 339 339 161 

Decision: Reject H 0 ; the proportions of peanuts for the three distributors are not equal. 


10.108 The hypotheses are 


H 0 ■ Pi ~ p-2 = 0.03, 
H\ : p\ - p 2 > 0.03. 


Computation: p\ = 0.60 and p 2 = 0.48. 

(0.60- 0.48) - 0.03 

* = . V ; = 2.18. 

7(0.60) (0.40) /200 + (0.48) (0.52) /500 

P - value = P(Z > 2.18) = 0.0146. 

Decision: Reject H 0 at level higher than 0.0146; the difference in votes favoring the 
proposal exceeds 3%. 

10.109 The hypotheses are 


H 0 : Pi = P2 = P3 = P4, 

Hi : p i,p 2 , Ps , and p 4 are not all equal. 


a = 0 . 01 . 

Critical region: y 2 > 11.345 with 3 degrees of freedom. 
Computation: 


Observed and expected frequencies 

Preference 

Maryland 

Virginia 

Georgia 

Alabama 

Total 

Yes 

65 (74) 

71 (74) 

78 (74) 

82 (74) 

296 

No 


29 (26) 

22 (26) 

18 (26) 

104 

Total 

100 

100 

100 

100 

400 


2 _ (65 - 74) 2 (71— 74) 2 (18 - 26)* 

74 74 26 ' ' 

Decision: Fail to reject Ho; the proportions of parents favoring Bibles in elementary 
schools are the same across states. 
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10.110 d = -2.905, s d = 3.3557, and t = = -2.12. Since 0.025 < P(T > 2.12) < 0.05 

with 5 degrees of freedom, we have 0.05 < P-value < 0.10. There is no significant 
change in WBC leukograms. 

10.111 ri\ = 15, X\ = 156.33, si = 33.09, n 2 = 18, x 2 = 170.00 and s 2 = 30.79. First we do the 

S 2 

/-test to test equality of the variances. Since f — ~k — 1.16 and /o.o5(15, 18) = 2.27, 
we conclude that the two variances are equal. 

To test the difference of the means, we first calculate s p = 31.85. Therefore, t = 
i56.33-i70.oo = _ 1-2 3 with a P-value > 0.10. 

(31.85)^/ 1/15+1/18 

Decision: H 0 cannot be rejected at 0.05 level of significance. 

10.112 n i = n 2 = 10, X\ = 7.95, Si = 1.10, x 2 = 10.26 and s 2 = 0.57. First we do the /-test to 

S 2 

test equality of the variances. Since f — ~k — 3.72 and /o.o5(9, 9) = 3.18, we conclude 
that the two variances are not equal at level 0.10. 

To test the difference of the means, we first find the degrees of freedom v = 13 when 
round up. Also, t = '■ 95-10.26 _ _5 gp with a P-value < 0.0005. 

y+.10 2 /10+0.57 2 /10 

Decision: Reject H 0 ] there is a significant difference in the steel rods. 

10.113 m = n 2 = 10, x x = 21.5, si = 5.3177, x 2 = 28.3 and s 2 = 5.8699. Since / = 4 = 

s 2 

0.8207 and /o.os(9, 9) = 3.18, we conclude that the two variances are equal. 

s p = 5.6001 and hence t = 2L5 7 28 ' 3 = —2.71 with 0.005 < P-value < 0.0075. 

p (5.6001)^1/10+1/10 

Decision: Reject Ho ; the high income neighborhood produces significantly more wastew- 
ater to be treated. 

10.114 m = n 2 = 16, x\ = 48.1875, .s, = 4.9962, x 2 = 43.7500 and s 2 = 4.6833. Since 

2 

f — ^ — 1.1381 and /o.o5(15, 15) = 2.40, we conclude that the two variances are equal. 
s„ = 4.8423 and hence t = 48 ' 1875 7 43 -' 50 o = 2.59. This is a two-sided test. Since 

p (4.8423)^/1/16+1/16 

0.005 < P(T > 2.59) < 0.0075, we have 0.01 < P-value < 0.015. 

Decision: Reject Ho ; there is a significant difference in the number of defects. 

10.115 The hypotheses are: 

H 0 : p = 24 x 10” 4 gm, 

Hi : p < 24 x 10 -4 gm. 

t = 22 8 8 /Hi = —1.77 with 0.025 < P-value < 0.05. Hence, at significance level of 
a = 0.05, the mean concentration of PCB in malignant breast tissue is less than 
24 x 10 -4 gm. 



Chapter 11 

Simple Linear Regression and 
Correlation 


11.1 (a) = 778.7, = 2050.0, J2 X 1 = 26,591.63, E x t y t = 65,164.04, n = 25. 

iii i 

Therefore, 

= (25)(65, 164.04) - (778.7)(2050.0) = 

(25)(26, 591.63) - (778.7) 2 

2050 - (0.5609)(778.7) 
a = — = 64.53. 


(b) Using the equation y = 64.53 + 0.5609a; with x = 30, we find y = 64.53 + 
(0.5609)(30) = 81.40. 

(c) Residuals appear to be random as desired. 



10 20 30 40 50 60 

Arm Strength 


E Xi = 707, E Vi = 658, E x l = 57, 557, E x iVi = 53, 258, n = 9. 


= (9) (53, 258) — (707) (658) _ 

(9)(57, 557) — (707) 2 ’ 


658 - (0.7771) (707) 


= 12.0623. 
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Hence y = 12.0623 + 0.7771a:. 

(b) For x = 85, y = 12.0623 + (0.7771) (85) = 78. 

11.3 (a) '£x i = 16.5, E y t = 100.4, E x- = 25.85, E x lVl = 152.59, n = 11. 

iii i 


b = 


a = 


(11)(152.59) - (16.5)(100.4) _ 1 
(11)(25.85) - (16.5) 2 
100.4 - (1.8091)(16.5) = g 413g 


Hence y = 6.4136 + 1.8091a: 

(b) For x = 1.75, y = 6.4136 + (1.8091)(1.75) = 9.580. 

(c) Residuals appear to be random as desired. 



1.0 1.2 1.4 1.6 1.8 2.0 

Temperature 


11.4 (a) E Xi = 311.6, E Vi = 297.2, £ x 2 = 8134.26, E x iVi = 7687.76, n ■- 

iii i 

b = (12)(7687.26) — (311.6) (297.2) 2 _ Q . 

= 297.2 - (-0.68Fl)(311.6) = 

12 

Hence y = 42.582 — 0.6861a:. 

(b) At x = 24.5, y = 42.582 - (0.6861)(24.5) = 25.772. 

11.5 (a) E*» = 675, E Vi = 488, E A = 37, 125, E x dk = 25, 005, n = 18. 

iii i 


b = 


a = 


(18) (25, 005) - (675) (488) 

(18) (37, 125) - (675) 2 “ ' 

488 - (0.5676)(675) = g ^ 
18 


Therefore, 


12 . 


Therefore, 


Hence y = 5.8254 + 0.5676x 
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(b) The scatter plot and the regression line are shown below. 



Temperature 


(c) For x = 50, y = 5.8254 + (0.5676) (50) = 34.205 grams. 
11.6 (a) The scatter plot and the regression line are shown below. 



Placement Test 

(b) Yl x i = IHO, YlVi = H73, = 07, 100, Yl x iUi = 67,690, n = 20. Therefore, 

iii i 

(20)(67,690)~ (1110)(1173) 

(20) (67, 100) — (1110) 2 
1173-(0.4711)(1110) 

20 

Hence y = 32.5059 + 0.4711a; 

(c) See part (a). 

(d) For y = 60, we solve 60 = 32.5059 + 0.4711a; to obtain x = 58.466. Therefore, 
students scoring below 59 should be denied admission. 


11.7 (a) The scatter plot and the regression line are shown here. 
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Advertising Costs 


(b) Yl x i = 410, = 5445, Y X 1 = 15,650, Y x iUi = 191,325, n = 12. Therefore, 

iii i 


b (12)(191, 325) — (410) (5445) 
(12) (15, 650) - (410) 2 


3.2208, 


= 5445 -(3.22081(410) = 
12 


Hence y = 343.7056 + 3.2208a; 

(c) When x = $35, y = 343.7056 + (3.2208) (35) = $456.43. 

(d) Residuals appear to be random as desired. 



20 25 30 35 40 45 50 

Advertising Costs 


11.8 (a) y — —1.70 + 1.81a;. 

(b) x = (54 + 1.71) /1.81 = 30.78. 

11.9 (a) E *i = 45, E Vi = 1094, E X 1 = 244.26, E X M = 5348.2, n = 9. 

iii i 


(9) (5348.2) - (45) (1094) 


= 153.1755. 


Hence y = 153.1755 - 6.3240a;. 





Solutions for Exercises in Chapter 11 


153 


(b) For x = 4.8, y = 153.1755 - (6.3240)(4.8) = 123. 

11.10 (a) 5 = cd w , Inf = Inc + (In d)w, setting y — In a — Inc, b = In d, and y = a + bx, 

we have 


X = w 

1 

2 

2 

3 

5 

5 

y — In z 

8.7562 

8.6473 

8.6570 

8.5932 

8.5142 

8.4960 


£24 = 18 ? YlVi = 51.6639, Y, x i = 68, £ xyiy = 154.1954, n = 6. 


b = In d 


a = In c 


(6)(154.1954) - (18)(51.6639) 
(6)(68) — (18) 2 
51.6639- (-0.0569)(18) 
6 = 8 ‘ 


■ = -0.0569 
7813. 


5 


Now c = e 8 ' 7813 = 6511.3364, d = e~ om 69 = 0.9447, and z = 6511.3364 x 0.9447™ 
(b) For w = 4, z = 6511.3364 x 0.9447 4 = $5186.16. 


11.11 (a) The scatter plot and the regression line are shown here. 



(b) £ = 14, 292, £ y t = 35, 578, £ x? = 22, 954, 054, £ x iVi = 57, 441, 610, n = 9. 

iii i 

Therefore, 


b = 


a = 


(9) (57, 441, 610) - (14, 292)(35, 578) 
(9) (22, 954, 054) - (14,292) 2 
35,578- (3.6529)(14, 292) 


= 3.6529, 


9 


= -1847.69. 


Hence y = —1847.69 + 3.6529a:. 


11.12 (a) The scatter plot and the regression line are shown here. 
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(b) Y, x i = 401 > E Vi = 2301, J2 x i 

i i i 


22,495, £ 

i 


118,652, n 


Therefore, 


b = 


a = 


(8) (118, 652) - (401)(2301)) 

(8)(22, 495) - (401) 2 

2301 - (1.3839)(401) 

= 218. 

8 


1.3839, 


Hence y = 218.26 + 1.3839a;. 

(c) For x = 65°F, y = 218.26 + (1.3839)(65) = 308.21. 

11.13 (a) The scatter plot and the regression line are shown here. A simple linear model 

seems suitable for the data. 



(b) Yl x i = 999, EUi = 670, x'f — 119,969, ^ x l y l = 74,058, n = 10. Therefore, 

iii i 


_ (10) (74, 058) - (999) (670) _ 
“ (10) (119, 969) - (999) 2 “ 

670 — (0.3533)(999) Qi ^ 


Hence y = 31.71 + 0.3533a;. 
(c) See (a). 
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11.14 From the data summary, we obtain 

= (12)(318) — [(4) (12)] [(12) (12)] = 

(12)(232) — [(4)(12)] 2 ‘ ’ 

a — 12 — (—6.45) (4) = 37.8. 

Hence, y = 37.8 — 6.45a:. It appears that attending professional meetings would not 
result in publishing more papers. 

11.15 The least squares estimator A of a is a linear combination of normally distributed 
random variables and is thus normal as well. 


E(A) = E(Y - Bx) = E(Y) - xE(B) = a + f3x - (3x = a, 

9 99 <T~ X 2 (J 2 

cr\ — a V-Bx = G \ + - ^ xg yb = 1- — , since a YB = 0. 

U E(^-*) 2 


Hence 




E- x ' 

2=1 


■o' . 


n J2( x i - X ) 2 
2=1 


11.16 We have the following: 


-i n 1 n 

Cov(Y,B)=E{{-Y J Y i --J2^, 


2=1 


n 


2=1 


Y( x i ~ x ) Y i J2( x i ~ x )hYi 

2=1 2=1 

n n 

E^-s ) 2 Y( x i — x ) 2 


E( x * ~ x )E(Yi - y Yi ) 2 + Y(. x i- x )E(Yi - p Yi )(Yj - p Yj ) 

i=l i^j 

n 

n Y( x i ~ x ) 2 
2=1 
n 

E ( x * - x ) G Yi + E Cov(Yi , Yj) 
i= 1 i^j 

n 

n E ( x i - x ) 2 
2=1 


Now, a Y . = a 2 for all i, and Cov(Yi, Yj) = 0 for i ^ j. Therefore, 


c 2 ~ x ) 

Cov(Y, B) = — ^ 

n E i x i ~ x ) 2 


0 . 
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11.17 S xx = 26,591.63 - 778.7 2 /25 = 2336.6824, S yy = 172,891.46 - 2050 2 /25 = 4791.46, 
S xy = 65, 164.04 - (778.7) (2050)/25 = 1310.64, and b = 0.5609. 

(a) S 2 = ™.46-i0.MO9)(1310.64) = m 353 

(b) The hypotheses are 

H 0 : (3 = 0, 

Hu 0. 

a = 0.05. 

Critical region: t < —2.069 or t > 2.069. 

Computation: t = °- 5609 = 2.04. 

176.362/2336.6824 

Decision: Do not reject H 0 . 

11.18 S xx = 57,557 - 707 2 /9 = 2018.2222, S yy = 51,980 - 658 2 /9 = 3872.8889, S xy = 
53,258 - (707)(658)/9 = 1568.4444, a = 12.0623 and b = 0.7771. 


(a) S 2 = j»” i«8»-<0.7T71)(156 8 .4H4) = 379 jgg 

(b) Since s = 19.472 and to .025 = 2.365 for 7 degrees of freedom, then a 95% confidence 
interval is 


12.0623 ± (2.365) 


(379.150)(57, 557) 
(9X2018^222) 


12.0623 ± 81.975, 


which implies —69.91 < a < 94.04. 

(c) 0.7771 ± (2.365) implies -0.25 < (5 < 1.80. 

11.19 S xx = 25.85 - 16.5 2 / 11 = 1.1, S yy = 923.58 - 100.4 2 /11 = 7.2018, S xy = 152.59 - 
(165) (100.4) /II = 1.99, a = 6.4136 and b = 1.8091. 

(a) S 2 = 7.2018-(1.8091)(1.99) q 4 q 

(b) Since s = 0.632 and £ 0.025 = 2.262 for 9 degrees of freedom, then a 95% confidence 
interval is 


6.4136 ± (2.262)(0.632) 


25.85 

(n)(i.i) 


6.4136 ±2.0895, 


which implies 4.324 < a < 8.503. 

(c) 1.8091 ± (2.262) (0.632) / \/TT implies 0.446 < (5 < 3.172. 

11.20 S xx = 8134.26 - 311.6 2 /12 = 43.0467, S yy = 7407.80 - 297.2 2 /12 = 47.1467, S xy = 
7687.76 - (311.6)(297.2)/12 = -29.5333, a = 42.5818 and b = -0.6861. 
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(b) Since s = 1.640 and £ 0.005 = 3.169 for 10 degrees of freedom, then a 99% confidence 
interval is 


42.5818 ± (3.169)(1. 640) 


8134.26 
(12) (43.0467) 


42.5818 ±20.6236, 


which implies 21.958 < a < 63.205. 

(c) -0.6861 ± (3.169)(1.640)/V43.0467 implies -1.478 < (3 < 0.106. 

11.21 S xx = 37,125 - 675 2 / 18 = 11,812.5, S yy = 17,142 - 488 2 /18 = 3911.7778, S xy = 
25,005 - (675)(488)/18 = 6705, a = 5.8254 and b = 0.5676. 

(a) s 2 = ^911' 7778— (tC 5676) (6705) = Q ^ 

(b) Since s = 2.574 and £ 0.005 = 2.921 for 16 degrees of freedom, then a 99% confidence 
interval is 


5.8261 ±(2.921)(2.574) 


37, 125 


(18)(11, 812.5) 


5.8261 ±3.1417, 


which implies 2.686 < a < 8.968. 

(c) 0.5676 ± (2.921)(2.574)/ v /ll, 812.5 implies 0.498 < (5 < 0.637. 


11.22 The hypotheses are 


H 0 : a = 10, 

Hi : a > 10 . 


a = 0.05. 

Critical region: £ > 1.734. 

Computations: S xx = 67, 100 - 1110 2 /20 = 5495, S yy = 74, 725 - 1173 2 /20 = 5928.55, 
S xy = 67,690 - (1110)(1173)/20 = 2588.5, s 2 = 5928 - 55 -(° i 4 8 711 )(2588.5) = 2 61.617 and 
then s = 16.175. Now 

32.51 - 10 

£ = , = 1.78. 

16.175a/67, 100/(20)(5495) 

Decision: Reject Ho and claim a > 10. 

11.23 The hypotheses are 

H 0 : /3 = 6, 

H 1 : 0 < 6. 


a = 0.025. 

Critical region: £ = —2.228. 
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Computations: S xx = 15, 650 - 410 2 /12 = 1641.667, S yy = 2, 512.925 - 5445 2 /12 = 
42,256.25, S xy = 191,325 - (410)(5445)/12 = 5,287.5, s 2 = 42 > 256 - 25 U3,22i)( 5 , 287.5) = 
2, 522.521 and then s = 50.225. Now 


50.225/V1641.667 

Decision: Reject H 0 and claim (3 < 6. 

11.24 Using the value s = 19.472 from Exercise 11.18(a) and the fact that f/o = 74.230 when 
Xq = 80, and x = 78.556, we have 

fl 1.444 2 

74.230 ± 2.365) 19.472) \ - + = 74.230 ± 15.4216. 

v A ; V 9 2018.222 

Simplifying it we get 58.808 < /iy | 80 < 89.652. 

11.25 Using the value s = 1.64 from Exercise 11.20(a) and the fact that yo = 25.7724 when 
xq = 24.5, and x = 25.9667, we have 

(a) 25.7724 ± (2.228) (1.640) y/ ± = 25.7724 ± 1.3341 implies 24.438 < 

Ly | 24.5 < 27.106. 

(b) 25.7724 ± (2.228) (1.640)^1 + ^ + = 25.7724 ± 3.8898 implies 21.883 < 

yo < 29.662. 


11.26 95% confidence bands are obtained by plotting the limits 


| (nr 2 

(6.4136 + 1.809a) ± (2.262)(0.632)^/ — + 2 — 



Temperature 


11.27 Using the value s = 0.632 from Exercise 11.19(a) and the fact that yo 
x 0 = 1.6, and x = 1.5, we have 


9.308 when 


9.308 ± (2.262)(0.632) 



0.1 2 

TT 


9.308 ± 1.4994 


implies 7.809 < y 0 < 10.807. 
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11.28 sing the value s = 2.574 from Exercise 11.21(a) and the fact that yo = 34.205 when 
Xq = 50, and x = 37.5, we have 

(a) 34.205 ± (2.921)(2.574) = 34.205 ± 1.9719 implies 32.23 < p Y | 50 < 

36.18. 

(b) 34.205 ± (2.921) (2.574) yjl + ^ + = 34.205 ± 7.7729 implies 26.43 < y 0 < 

41.98. 

11.29 (a) 17.1812. 

(b) The goal of 30 mpg is unlikely based on the confidence interval for mean mpg, 
(27.95,29.60). 

(c) Based on the prediction interval, the Lexus ES300 should exceed 18 mpg. 

11.30 It is easy to see that 

n n n 

53 (^ - yi) = 53 ~ a ~ bx d = 53 ^ ~ (y ~ ~ bXi ) 

i=l i — 1 i= 1 

n n 

= 53 (y* ~ yd ~ b 53 = °> 

i = 1 i = 1 

since a — y — bx. 


11.31 When there are only two data points X\ ^ x 2 , using Exercise 11.30 we know that 
(yi — Vi) + (y 2 *- i/ 2 ) = 0. On the other hand, by the method of least squares on page 
395, we also know that x\{y\ — ij\) + x 2 (y 2 — # 2 ) = 0. Both of these equations yield 
(x 2 - x 1 )(y 2 ~ m) = 0 and hence y 2 - y 2 = 0. Therefore, y 2 - yi — 0. So, 

SSE = (j/i - i/ 1) 2 + ( 1/2 - i/ 2) 2 = 0. 

Since R 2 = 1 — |||?, we have i? 2 = 1. 

11.32 (a) Suppose that the fitted model is y = bx. Then 

n n 

sse = 53 (yi - Vi) 2 = 53 _ bx d 2 - 

i= 1 2=1 

Taking derivative of the above with respect to b and setting the derivative to zero, 

n 

Ti 53 

we have —2 )T) ay(y, — bxi ) = 0, which implies 6 = *=£ . 

*=1 E 

7=1 

/ n \ n 

Varljr, x i Y i) T, 2 

(b) a 2 n — 4F 1 — = ^ = -it — , since K’s are independent. 

(E x i) 2 (E* 2 ) 2 E a:? 
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n n 

(b) £ xj = 1629 and £ x iVi = 5564. Hence b = f§§ = 3.4156. So, y = 3.4156a;. 

i = 1 i— 1 

(c) See (a). 

(d) Since there is only one regression coefficient, /3, to be estimated, the degrees of 
freedom in estimating a 2 is n — 1. So, 


d 2 = 


SSE 

n — 1 


n 


E(2/< 


tei ) 2 


n — 1 


(e) V arijji) = Var(Bxi) = x 2 V 'ar(B) = 

E 

i=l 

(f) The plot is shown next. 



X 


11.34 Using part (e) of Exercise 11.33, we can see that the variance of a prediction y 0 at Xq 





Solutions for Exercises in Chapter 11 


161 


is o 


2 

yo 


= a 



Hence the 95% prediction limits are given as 


(3.4145)(25) ± (2.776)^11.16132 



25 2 

1629 


85.3625 ± 10.9092, 


which implies 74.45 < y 0 < 96.27. 


E XiVi 

11.35 (a) As shown in Exercise 11.32, the least squares estimator of (3 is b — ±1 . 

E A 

i= 1 

n n 

(b) Since Y x iUi — 197.59, and Y x i = 98.64, then b = g 97 ^ 59 = 2.003 and y = 2.003a;. 

i= 1 i= 1 

11.36 It can be calculated that b = 1.929 and a = 0.349 and hence y = 0.349 ± 1.929a; when 
intercept is in the model. To test the hypotheses 

H 0 : a = 0, 

Hi : a % 0 , 


with 0.10 level of significance, we have the critical regions as t < —2.132 or t > 2.132. 
Computations: s 2 = 0.0957 and t = , = 0,349 = 1.40. 

( 0 . 0957 )( 98 . 64 )/( 6 )( 25 . 14 ) 

Decision: Fail to reject H 0 ] the intercept appears to be zero. 

11.37 Now since the true model has been changed, 


E(B) 


£(*1—*! )E(Y.) 

i= 1 


n 


Yi x u 

i = 1 


a;i)(a + f3x u + 'jx 2i ) 


n 


Y( x l i - x i) 2 

i— 1 


n 


Y( x u ~ ^i) 2 

i= 1 


n n 

PY( X U - ^i) 2 + 7E(hi - Xi)x 2 i 

i — 1 i — 1 


n 


Y( x u ~ x i ) 2 
2=1 


Y( X li ~ x l) x 2i 

P + 7 ^ 

Yi x u - x i ) 2 


11.38 The hypotheses are 

H 0 : (3 = 0, 

Hi : (3 % 0. 

Level of significance: 0.05. 

Critical regions: / > 5.12. 

Computations: SSR = bS xy = E 8991 = 3.60 and SSE = S yy — SSR = 7.20 — 3.60 = 
3.60. 
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Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Regression 

3.60 

1 

3.60 

9.00 

Error 

3.60 

9 

0.40 


Total 

7.20 

10 




Decision: Reject Hq. 

11.39 (a) ,S'„ = 1058, S„ = 198.76, S, v = -363.63, /.-§=- -0.34370, and a = 

210 — ( — 0 . 34370 ) ( 172 . 5 ) = 10 . 81153 . 

(b) The hypotheses are 


Hq : The regression is linear in x, 

H i : The regression is nonlinear in x. 


a = 0.05. 

Critical regions: / > 3.10 with 3 and 20 degrees of freedom. 
Computations: SST = S yy = 198.76, SSR = bS xy = 124.98 and 
SSE = S yy - SSR = 73.98. Since 

Ti. = 51.1, T- 2 . = 51.5, T 3 . = 49.3, T 4 . = 37.0 and T 5 . = 22.1, 

then 

5 5 5 rj~t 2 

SSE( pure) = V V yj - V = 1979.60 - 1910.272 = 69.33. 

0 

i = 1 j = l 2—1 

Hence the “Lack-of-ht SS” is 73.78 — 69.33 = 4.45. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

124.98 

1 

124.98 


Error 

73.98 

23 

3.22 


f Lack of fit 

f 4.45 

f 3 

r i.48 

0.43 

| Pure error 

\ 69.33 

\ 20 

\ 3.47 


Total 

198.76 

24 




Decision: Do not reject Hq. 
11.40 The hypotheses are 


Hq : The regression is linear in x, 

H i : The regression is nonlinear in x. 


a = 0.05. 

Critical regions: / > 3.26 with 4 and 12 degrees of freedom. 
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Computations: SST = S yy = 3911.78, SSR = bS xy = 3805.89 and SSE = S yy — 

6 3 6 , 

SSR = 105.89. SSE^piiie) = Uij ~ = 69.33, and the “Lack-of-fit SS” is 

105.89 - 69.33 = 36.56. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

3805.89 

1 

3805.89 


Error 

105.89 

16 

6.62 


f Lack of fit 

f 36.56 

I 4 

f 9.14 

1.58 

[ Pure error 

\ 69.33 

1 12 

l 5.78 


Total 

3911.78 

17 




Decision: Do not reject Ef 0 ; the lack-of-ht test is insignificant. 
11.41 The hypotheses are 


H 0 : The regression is linear in x, 

Hi : The regression is nonlinear in x. 

a = 0.05. 

Critical regions: / > 3.00 with 6 and 12 degrees of freedom. 

Computations: SST = S yy = 5928.55, SSR = bS xy = 1219.35 and SSE = S yy — 

8 rii 8 2 

SSR = 4709.20. S'S'F(pure) = ufj ~ it = 3020.67, and the “Lack-of-fit SS” 
is 4709.20 - 3020.67 = 1688.53. * J 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

1219.35 

1 

1219.35 


Error 

4709.20 

18 

261.62 


f Lack of fit 

f 1688.53 

f 6 

( 281.42 

1.12 

| Pure error 

\ 3020.67 

l 12 

\ 251.72 


Total 

5928.55 

19 




Decision: Do not reject Ho ; the lack-of-fit test is insignificant. 

11.42 (a) t = 2.679 and 0.01 < P(T > 2.679) < 0.015, hence 0.02 < P-value < 0.03. There 
is a strong evidence that the slope is not 0. Hence emitter drive-in time influences 
gain in a positive linear fashion. 

(b) / = 56.41 which results in a strong evidence that the lack-of-fit test is significant. 
Hence the linear model is not adequate. 

(c) Emitter does does not influence gain in a linear fashion. A better model is a 
quadratic one using emitter drive-in time to explain the variability in gain. 
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11.43 

11.44 

11.45 


11.46 

11.47 


y = —21.0280 + 0.4072a;; /lof = 1.71 with a P-value = 0.2517. Hence, lack-of-fit test 
is insignificant and the linear model is adequate. 

(a) y = 0.011571 + 0.006462a; with t = 7.532 and P(T > 7.532) < 0.0005 Hence 
P-value < 0.001; the slope is significantly different from 0 in the linear regression 
model. 

(b) /lof = 14.02 with P-value < 0.0001. The lack-of-fit test is significant and the 
linear model does not appear to be the best model. 

(a) y = —11.3251 — 0.0449 temperature. 

(b) Yes. 

(c) 0.9355. 

(d) The proportion of impurities does depend on temperature. 



-270 -265 -260 

Temperature 


However, based on the plot, it does not appear that the dependence is in linear 
fashion. If there were replicates, a lack-of-fit test could be performed. 

(a) y = 125.9729 + 1.7337 population; P-value for the regression is 0.0023. 

(b) / 6,2 = 0.49 and P-value = 0.7912; the linear model appears to be adequate based 
on the lack-of-fit test. 

(c) /12 = 11.96 and P-value = 0.0744. The results do not change. The pure error 
test is not as sensitive because the loss of error degrees of freedom. 

(a) The figure is shown next. 

(b) y = -175.9025 + 0.0902 year; R 2 = 0.3322. 

(c) There is definitely a relationship between year and nitrogen oxide. It does not 
appear to be linear. 
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11.48 The ANOVA model is: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

135.2000 

1 

135.2000 


Error 

10.4700 

14 

0.7479 


f Lack of fit 

f 6.5150 

I 2 

f 3.2575 

9.88 

| Pure error 

1 3.9550 

l 12 

1 0.3296 


Total 

145.6700 

15 




The P-value = 0.0029 with / = 9.88. 

Decision: Reject H 0 ; the lack-of-fit test is significant. 


11.49 S xx = 36,354 - 35,882.667 = 471.333, S yy 
S xy = 36, 926 - 36, 810.667 = 115.333. So, r 


= 38, 254 - 37, 762.667 = 491.333, and 
= 7 115 = 0.240. 

( 471 . 333 ) ( 491 . 333 ) 


11.50 The hypotheses are 


H 0 : p = 0, 
Hr- 0 . 


a = 0.05. 

Critical regions: t < —2.776 or t > 2.776. 
Computations: t = ^======g = 0.51. 

Decision: Do not reject Po- 


ll. 51 Since b = we can write s 2 = 

2 _ Syy-r Syy _ (1~' )Syy ^ lieilCe 

71 — 71 — V ' 


Syv b l Sxx . Also, b = rJ so that 
n—2 ’ V b xx 
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11.52 (a) S xx = 128.6602 - 32.68 2 /9 = 9.9955, S yy = 7980.83 - 266.7 2 /9 = 77.62, and 

S xv = 990.268 - (32.68)(266.7)/9 = 21.8507. So, r = . 2L8507 = 0.784. 

V \ J\ )/ ’ x /(9.9955)(77.62) 

(b) The hypotheses are 

H 0 : p = 0, 

Hi : p > 0 . 

a = 0 . 01 . 

Critical regions: t > 2.998. 

Computations: t = = 3.34. 

Decision: Reject H 0 ; p > 0. 

(c) (0.784) 2 (100%) = 61.5%. 

11.53 (a) From the data of Exercise 11.1 we can calculate 

S xx = 26, 591.63 - (778.7) 2 /25 = 2336.6824, 

Syy = 172, 891.46 - (2050) 2 /25 = 4791.46, 

S xy = 65, 164.04 - (778.7) (2050)/25 = 1310.64. 

Therefore, r = , 13ia(>1 = 0.392. 

-y (2236.6824) (4791.46) 

(b) The hypotheses are 


H 0 : p = 0, 

Hi : p % 0. 


a = 0.05. 

Critical regions: t < —2.069 or t > 2.069. 

Computations: t = J 1 392 ^, = 2.04. 

Decision: Fail to reject Hq at level 0.05. However, the P-value = 0.0530 which is 
marginal. 

11.54 (a) From the data of Exercise 11.9 we find S xx = 244.26 — 45 2 /9 = 19.26, S yy = 
133,786 - 1094 2 /9 = 804.2222, and S xy = 5348.2 - (45)(1094)/9 = -121.8. So, 
r = =HM = -0 979 

\J (19.26)(804.2222) 

(b) The hypotheses are 


H 0 \ p= -0.5, 
Hi : p < —0.5. 


a = 0.025. 

Critical regions: z < —1.96. 


Computations: z — ^ In 
Decision: Reject 77 0 ; p < —0.5. 


(0.021)(1.5) 

(1.979)(0.5) 


4 99 
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(c) (— 0.979) 2 (100%) = 95.8%. 

11.55 Using the value s = 16.175 from Exercise 11.6 and the fact that % = 48.994 when 
Xq = 35, and x = 55.5, we have 

(a) 48. 994± (2. 101) (16. 175)^/1/20 + (-20.5) 2 /5495 which implies to 36.908 < p Y \ 35 < 
61.080. 

(b) 48.994 ± (2.101)(16.175)- v /l + 1/20 + (-20.5) 2 /5495 which implies to 12.925 < 
yo < 85.063. 

11.56 The fitted model can be derived as y — 3667.3968 — 47.3289a;. 

The hypotheses are 

H 0 : /3 = 0, 

: P % 0 . 

t = —0.30 with P-value = 0.77. Hence H 0 cannot be rejected. 

11.57 (a) S xx = 729.18 - 118.6 2 /20 = 25.882, S xy = 1714.62 - (118.6) (281.1)/20 = 47.697, 

so b = = 1.8429, and a = y — hx = 3.1266. Hence y = 3.1266 + 1.8429a;. 

(b) The hypotheses are 

Ho ; the regression is linear in x, 

H 1 ; the regression is not linear in x. 

a = 0.05. 

Critical region: / > 3.07 with 8 and 10 degrees of freedom. 

Computations: SST = 13.3695, SSR = 87.9008, SSE = 50.4687, SlSi^pure) = 
16.375, and Laek-of-ht SS = 34.0937. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

87.9008 

1 

87.9008 


Error 

50.4687 

18 

2.8038 


f Lack of fit 

f 34.0937 

I 8 

f 4.2617 

2.60 

| Pure error 

1 16.375 

l 10 

\ 1.6375 


Total 

138.3695 

19 




The P-value = 0.0791. The linear model is adequate at the level 0.05. 

11.58 Using the value s = 50.225 and the fact that y 0 = $448,644 when x 0 = $45, and 
x = $34,167, we have 

(a) 488.644 ± (1.812) (50.225) ^1/12 + which implies 452.835 < /j y , 45 < 

524.453. 
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(b) 488.644 ± (1.812) (50.225) ^/l + 1/12 + , which implies 390.845 < y 0 < 

586.443. 

11.59 (a) y = 7.3598 + 135.4034a;. 

(b) .SIS' (Pure Error) = 52,941.06; /lof = 0.46 with P-value = 0.64. The lack-of-fit 
test is insignificant. 

(c) No. 

11.60 (a) S xx = 672.9167, S vv = 728.25, S xv = 603.75 and r = . tm ~" 1 = 0.862, 

W XX ’ Vy ’ Xy \J (672.9167)(728.25) ’ 

which means that (0.862) 2 (100%) = 74.3% of the total variation of the values of 
Y in onr sample is accounted for by a linear relationship with the values of X. 

(b) To estimate and test hypotheses on p, X and Y are assumed to be random 
variables from a bivariate normal distribution. 

(c) The hypotheses are 


H 0 : p = 0.5, 
Hi : p > 0.5. 


a = 0 . 01 . 

Critical regions: z > 2.33. 
Computations: z — ^ In 


(1.862)(0.5) 
(0.138) (1.5) 


Decision: Reject H 0 ] p > 0.5. 


2.26. 


n 

E ivi-yi) 2 

11.61 s 2 = 1-1 n _ 2 — . Using the centered model, = y + b{xi — x) + e^. 


n 

(■ n - 2 )E(S 2 ) = E^[a + /3(xi - x) + 6i - (y + b(xi - x))] 2 

i=l 

n 

= y ^E\(a- y) 2 + (P- b) 2 (xi - x) 2 + e- - 2 b(xi - x)ei - 2 ye r ] , 

i= 1 

(other cross product terms go to 0) 

na 2 cr 2 S xx 2 cr 2 S xx na 2 

= + ^ — + na -2— 2 

n S xx S xx n 

= (n — 2)a 2 . 


11.62 (a) The confidence interval is an interval on the mean sale price for a given buyer’s 

bid. The prediction interval is an interval on a future observed sale price for a 
given buyer’s bid. 

(b) The standard errors of the prediction of sale price depend on the value of the 
buyer’s bid. 
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(c) Observations 4, 9, 10, and 17 have the lowest standard errors of prediction. These 
observations have buyer’s bids very close to the mean. 

11.63 (a) The residual plot appears to have a pattern and not random scatter. The R 2 is 

only 0.82. 

(b) The log model has an R 2 of 0.84. There is still a pattern in the residuals. 

(c) The model using gallons per 100 miles has the best R 2 with a 0.85. The residuals 
appear to be more random. This model is the best of the three models attempted. 
Perhaps a better model could be found. 

11.64 (a) The plot of the data and an added least squares fitted line are given here. 



Temperature 

(b) Yes. 

(c) y = 61.5133 + 0.1139z. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Regression 

486.21 

1 

486.21 


Error 

24.80 

10 

2.48 


f Lack of fit 

f 3.61 

I 2 

f 1.81 

0.68 

| Pure error 

1 21.19 

l § 

1 2.65 


Total 

511.01 

11 




The P-value = 0.533. 

(d) The results in (c) show that the linear model is adequate. 

11.65 (a) y = 90.8904 — 0.0513a:. 

(b) The t - value in testing Hq : (3 — 0 is —6.533 which results in a P-value < 0.0001. 
Hence, the time it takes to run two miles has a significant influence on maximum 
oxygen uptake. 

(c) The residual graph shows that there may be some systematic behavior of the 
residuals and hence the residuals are not completely random 
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11.66 Let Y* — Y i — a, for i — 1, 2, . . . , n. The model Y t — a + /3xi + e; is equivalent to 
Y* = /3xi + £j. This is a “regression through the origin” model that is studied in 
Exercise 11.32. 

(a) Using the result from Exercise 11.32(a), we have 


E x i ( Vi - «) E x iVi ~ nxa 



i=l i = 1 


(b) Also from Exercise 11.32(b) we have . 

E *1 

i= 1 


n 

11.67 SSE = YhiVi ~ P x i) 2 - Taking derivative with respect to (3 and setting this as 0, we 

i= 1 

n n 

get ^2xi(yi — bxi ) = 0, or ^x^i/i — iji) = 0. This is the only equation we can get 

i = 1 i = 1 

n 

using the least squares method. Hence in general, (y, — y j) = 0 does not hold for a 

i= 1 

regression model with zero intercept. 


11.68 No solution is provided. 




Chapter 12 

Multiple Linear Regression and 
Certain Nonlinear Regression Models 

12.1 (a) )/ = 27.5467 + 0.9217xi + 0.2842x 2 . 

(b) When x\ = 60 and x 2 = 4, the predicted value of the chemistry grade is 
y = 27.5467+ (0.9217)(60) + (0.2842)(4) = 84. 

12.2 y = -3.3727 + 0.0036xi + 0.9476x 2 . 

12.3 y = 0.7800 + 2.7122xi + 2.0497x 2 . 

12.4 (a) y = -22.99316 + 1.39567xi + 0.21761x 2 . 

(b) y = -22.99316 + (1.39567)(35) + (0.21761)(250) = 80.25874. 

12.5 (a) y = 56.46333 + 0.15253x - 0.00008x 2 . 

(b) y = 56.46333 + (0.15253) (225) - (0.00008) (225) 2 = 86.73333%. 

12.6 (a) d = 13.35875 - 0.33944u - 0.01183u 2 . 

(b) (1 = 13.35875 - (-0.33944) (70) - (0.01183) (70) 2 = 47.54206. 

12.7 y = 141.61178 - 0.28193x + 0.00031x 2 . 

12.8 (a) y = 19.03333 + 1.0086a; - 0.02038x 2 . 

(b) SSE = 24.47619 with 12 degrees of freedom and .SIS' (pure error) = 24.36667 
with 10 degrees of freedom. So, SSLOF = 24.47619 — 24.36667 = 0.10952 with 
2 degrees of freedom. Hence / = 2 ° 36667/10 = 0-^2 with a P-value of 0.9778. 
Therefore, there is no lack of fit and the quadratic model fits the data well. 

12.9 (a) y = -102.71324 + 0.60537xi + 8.92364x 2 + 1.43746x s + 0.01361x 4 . 

(b) y = -102.71324+ (0.60537) (75) + (8.92364) (24) + (1.43746) (90) + (0.01361) (98) = 
287.56183. 
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12.10 (a) y = 1.07143 + 4.60317a; - 1.84524a; 2 + 0.19444a; 3 . 

(b) y = 1.07143 + (4.60317)(2) - (1.84524)(2) 2 + (0.19444)(2) 3 = 4.45238. 

12.11 y = 3.3205 + 0.42105xi - 0.29578x 2 + 0.01638x 3 + 0.12465x 4 . 

12.12 y = 1, 962.94816 - 15.85168xi + 0.05593x 2 + 1.58962x 3 - 4.21867x 4 - 394.31412x 5 . 

12.13 y = -6.51221 + 1.99941xi - 3.67510x 2 + 2.52449x 3 + 5.15808x 4 + 14.40116x 5 . 

12.14 y = -884.667 - 3 - 0.83813xi + 4.90661x 2 + 1.33113x3 + 11.93129x 4 . 

12.15 (a) y = 350.99427 - 1.27199a;i - 0.15390x 2 . 

(b) y = 350.99427- (1.27199)(20) - (0.15390)(1200) = 140.86930. 

12.16 (a) y = -21.46964 - 3.32434xi + 0.24649x 2 + 20.34481x 3 . 

(b) y = -21.46964 - (3.32434) (14) + (0.24649) (220) + (20.34481) (5) = 87.94123. 

12.17 s 2 = 0.16508. 

12.18 s 2 = 0.43161. 

12.19 s 2 = 242.71561. 

12.20 Using SAS output, we obtain 

= 3.747 x 10” 7 , d'l 2 = 0.13024, d blb2 = -4.165 x 10” 7 . 

12.21 Using SAS output, we obtain 

(a) dl 2 = 28.09554. 

(b) d- bl b 4 = —0.00958. 

12.22 Using SAS output, we obtain 

0.4516 < Py\ x1 = 9 oo,x 2 =i < 1-2083, and —0.1640 < y 0 < 1.8239. 

12.23 Using SAS output, we obtain a 90% confidence interval for the mean response when 
x = 19.5 as 29.9284 < p Y \x=i 9.5 < 31.9729. 

12.24 Using SAS output, we obtain 

263.7879 < p Y ^= 75 ^= 24 , * 3 = 90,* 4 =98 < 311.3357, and 243.7175 <y 0 < 331.4062. 

12.25 The hypotheses are 

Hq : 02 = 0; 

77, : fh % 0. 


The test statistic value is t — 2.86 with a P-value = 0.0145. Hence, we reject Ho and 
conclude /3 2 % 0. 
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12.26 The test statistic is t — 0 0 0 0 0 0 0 3 6 6 1 2 2 = 5.91 with P-value = 0.0002. Reject H 0 and claim 
that f3\ ^ 0. 

12.27 The hypotheses are 

H 0 : (h = 2, 

Hi: 2. 

The test statistics is t — 2 q 20209 2 = 3-524 with P-value = 0.0097. Reject H 0 and 
conclude that f3\ % 2. 

12.28 Using SAS output, we obtain 

(a) s 2 = 650.1408. 

(b) y = 171.6501, 135.8735 < A t y| Xl =2o,x 2 =iooo < 207.4268, and 
82.9677 <y 0 < 260.3326. 

12.29 (a) P-value = 0.3562. Hence, fail to reject H 0 . 

(b) P-value = 0.1841. Again, fail to reject Hq. 

(c) There is not sufficient evidence that the regressors X\ and X 2 significantly influence 
the response with the described linear model. 

12.30 (a) s 2 = 17.22858. 

(b) y = 104.9617 and 95.5660 < y 0 < 114.3574. 

12.31 P 2 = HP = = 99.97%. Hence, 99.97% of the variation in the response Y in our 

sample can be explained by the linear model. 

12.32 The hypotheses are: 

Ho : (3i = @2 — 0 , 

Hi : At least one of the /3/s is not zero, for i = 1, 2. 

Using the /-test, we obtain that / = H|| = ^J^ 6 ^ 29 = 12688.7 with P-value < 

0.0001. Hence, we reject Hq. The regression explained by the model is significant. 

12.33 / = 5.11 with P-value = 0.0303. At level of 0.01, we fail to reject H 0 and we cannot 
claim that the regression is significant. 

12.34 The hypotheses are: 

Ho '■ Pi = @2 = 0 , 

H 1 : At least one of the /3/s is not zero, for i = 1,2. 
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The partial /-test statistic is 


/ = 


(160.93598- 145.88354) /2 
1.49331 


= 5.04, with 2 and 7 degrees of freedom. 


The resulting P-value = 0.0441. Therefore, we reject H 0 and claim that at least one 
of Pi and p 2 is not zero. 


12.35 / = (h ' 90 °o 9 16 5 os 811 ^ 1 = 34.91 with 1 and 9 degrees of freedom. The P-value = 0.0002 
which implies that H 0 is rejected. 


12.36 (a) y = 0.900 + 0.575a;i + 0.550x 2 + 1.150x 3 . 

(b) For the model in (a), SSR = 15.645, SSE = 1.375 and SST = 17.020. The 
ANOVA tabic for all these single-degree-of- freedom components can be displayed 
as: 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Xi 

1 

2.645 

7.69 

0.0501 

%2 

1 

2.420 

7.04 

0.0568 

X 3 

1 

10.580 

30.78 

0.0052 

Error 

4 

0.34375 



Total 

7 


P 3 is found to be significant at the 0.01 level and Pi and p 2 are not significant. 


12.37 The hypotheses are: 


Hq : Pi — P 2 — 0 , 

Hi : At least one of the /3/s is not zero, for i = 1,2. 


The partial /-test statistic is 


/ = 


(4957.24074- 17.02338)/2 
242.71561 


= 10.18, with 2 and 7 degrees of freedom. 


The resulting P-value = 0.0085. Therefore, we reject Hq and claim that at least one 
of Pi and p 2 is not zero. 

12.38 Using computer software, we obtain the following. 


R(Pi | p 0 ) = 2.645, 

R(Pi I Po, P 2 , P 3 ) — R(Po j Pi, P 2 , P 3 ) ~ R(Po , P 2 , P 3 ) = 15.645 — 13.000 = 2.645. 

R(P 2 I Po,Pi) = R(Po, Pi, P 2 ) - R(Po,Pi) = 5.065 - 2.645 = 2.420, 

R(02 I Po,Pi,Ps) = R(Po, Pi, P 2 , P 3 ) ~ R(Po,Pi,P 3 ) = 15.645 - 13.225 = 2.420, 
R(Ps | Po,Pi,P2 ) = R(P 0 ,Pi,P 2 ,P3)-R(Po,Pi,P2) = 15.645 -5.065 = 10.580. 



Solutions for Exercises in Chapter 12 


175 


12.39 The following is the summary. 



s 2 

R 2 

d 2 

-^adi 

The model using weight alone 

8.12709 

0.8155 

0.8104 

The model using weight and drive ratio 

4.78022 

0.8945 

0.8885 


The above favor the model using both explanatory variables. Furthermore, in the 
model with two independent variables, the t-test for /? 2 , the coefficient of drive ratio, 
shows P-value < 0.0001. Hence, the drive ratio variable is important. 

12.40 The following is the summary: 



s 2 

C.V. 

td2 

-^adi 

Average Length of the CIs 

The model with x 3 

4.29738 

7.13885 

0.8823 

5.03528 

The model without x 3 

4.00063 

6.88796 

0.8904 

4.11769 


These numbers favor the model without using x 3 . Hence, variable x 3 appears to be 
unimportant. 


12.41 The following is the summary: 



s 2 

C.V. 

td2 

^adj 

The model with 3 terms 

0.41966 

4.62867 

0.9807 

The model without 3 terms 

1.60019 

9.03847 

0.9266 


Furthermore, to test (3n = j3 12 = P 22 = 0 using the full model, / = 15.07 with 
P-value = 0.0002. Hence, the model with interaction and pure quadratic terms is 
better. 

12.42 (a) Full model: y = 121.75 + 2.50xi + 14.75x2 + 21.75x 3 , with P 2 dj = 0.9714. 

Reduced model: y = 121.75 + 14.75x 2 + 21.75x 3 , with P 2 dj = 0.9648. 

There appears to be little advantage using the full model. 

(b) The average prediction interval widths are: 

full model: 32.70; and reduced model: 32.18. Hence, the model without using X\ 
is very competitive. 

12.43 The following is the summary: 



S' 2 

C.V. 

td2 

^adi 

Average Length of the CIs 

Xi, x 2 

650.14075 

16.55705 

0.7696 

106.60577 

Xi 

967.90773 

20.20209 

0.6571 

94.31092 

X 2 

679.99655 

16.93295 

0.7591 

78.81977 


In addition, in the full model when the individual coefficients are tested, we obtain 
P-value = 0.3562 for testing /?i = 0 and P-value = 0.1841 for testing /3 2 = 0. 

In comparing the three models, it appears that the model with x 2 only is slightly 
better. 
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12.44 Here is the summary for all four models (including the full model) 



s 2 

C.V. 

c>2 

-^adi 

X\, X 2 , X 3 

17.22858 

3.78513 

0.9899 

Xi, X 2 

297.97747 

15.74156 

0.8250 

Xi, x 3 

17.01876 

3.76201 

0.9900 

x 2 , X 3 

17.07575 

3.76830 

0.9900 


It appears that a two-variable model is very competitive with the full model as long as 
the model contains x 3 . 

12.45 (a) y = 5.95931-0.00003773 odometer +0.33735 octane -12.62656 van -12.98455 suv. 

(b) Since the coefficients of van and suv are both negative, sedan should have the best 
gas mileage. 

(c) The parameter estimates (standard errors) for van and suv are —12.63 (1.07) and 
—12,98 (1.11), respectively. So, the difference between the estimates are smaller 
than one standard error of each. So, no significant difference in a van and an suv 
in terms of gas mileage performance. 

12.46 The parameter estimates are given here. 


Variable 

DF 

Estimate 

Standar Error 

t 

P-value 

Intercept 

1 

-206.64625 

163.70943 

-1.26 

0.2249 

Income 

1 

0.00543 

0.00274 

1.98 

0.0649 

Family 

1 

-49.24044 

51.95786 

-0.95 

0.3574 

Female 

1 

236.72576 

110.57158 

2.14 

0.0480 


(a) y = —206.64625 + 0.00543Income — 49.24044Family + 236/72576Female. The 
company would prefer female customers. 

(b) Since the P-value = 0.0649 for the coefficient of the “Income,” it is at least 
marginally important. Note that the R 2 = 0.3075 which is not very high. Perhaps 
other variables need to be considered. 

12.47 (a) HangTime = 1.10765 + 0.01370 LLS + 0.00429 Power. 

(b) HangTime = 1.10765 + (0.01370)(180) + (0.00429) (260) = 4.6900. 

(c) 4.4502 < gUang Time | LLS=180, Power=260 < 4.9299. 

12.48 (a) For forward selection, variable X\ is entered first, and no other variables are entered 

at 0.05 level. Hence the final model is y = —6.33592 + 0. 337380 + 

(b) For the backward elimination, variable x 3 is eliminated first, then variable x 4 and 
then variable x 2 , all at 0.05 level of significance. Hence only x\ remains in the 
model and the final model is the same one as in (a). 
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(c) For the stepwise regression, after x\ is entered, no other variables are entered. 
Hence the final model is still the same one as in (a) and (b). 


12.49 Using computer output, with a = 0.05, x 4 was removed first, and then X\ . Neither X 2 
nor X 3 were removed and the final model is y — 2.18332 + 0.95758x2 + 3.32533x3. 

12.50 (a) y = -29.59244 + 0.27872xi + 0.06967x 2 + 1.24195x 3 - 0.39554x 4 + 0.22365x 5 . 

(b) The variables X 3 and X 5 were entered consecutively and the final model is y = 
-56.94371 + 1.63447x3 + 0.24859x 5 . 

(c) We have a summary table displayed next. 


Model 

s 2 

PRESS 

R 2 

EW 

i 

x 2 x 5 

176.735 

2949.13 

0.7816 

151.681 

xix 5 

174.398 

3022.18 

0.7845 

166.223 

X 1 X 3 X 5 

174.600 

3207.34 

0.8058 

174.819 

X 3 X 5 

194.369 

3563.40 

0.7598 

189.881 

X 3 X 4 X 5 

192.006 

3637.70 

0.7865 

190.564 

X 2 X 4 X 5 

196.211 

3694.97 

0.7818 

170.847 

X 2 X 3 X 5 

186.096 

3702.90 

0.7931 

184.285 

X 3 X 4 

249.165 

3803.00 

0.6921 

192.172 

xix 2 x 5 

184.446 

3956.41 

0.7949 

189.107 

x 5 

269.355 

3998.77 

0.6339 

189.373 

x 3 

257.352 

4086.47 

0.6502 

199.520 

X 2 X 3 X 4 X 5 

197.782 

4131.88 

0.8045 

192.000 

Xl 

274.853 

4558.34 

0.6264 

202.533 

X 2 X 3 

264.670 

4721.55 

0.6730 

210.853 

xix 3 

226.777 

4736.02 

0.7198 

219.630 

X 1 X 3 X 4 X 5 

188.333 

4873.16 

0.8138 

207.542 

X2 

328.434 

4998.07 

0.5536 

217.814 

X 4 X 5 

289.633 

5136.91 

0.6421 

209.232 

X 1 X 2 X 3 X 5 

195.344 

5394.56 

0.8069 

216.934 

X 2 X 3 X 4 

269.800 

5563.87 

0.7000 

234.565 

XiX 2 

297.294 

5784.20 

0.6327 

231.374 

X 1 X 4 X 5 

192.822 

5824.58 

0.7856 

216.583 

X 1 X 3 X 4 

240.828 

6564.79 

0.7322 

248.123 

X 2 X 4 

352.781 

6902.14 

0.5641 

248.621 

X 1 X 2 X 4 X 5 

207.477 

7675.70 

0.7949 

249.604 

X 1 X 2 X 3 X 4 X 5 

214.602 

7691.30 

0.8144 

257.732 

X 1 X 4 

287.794 

7714.86 

0.6444 

249.221 

X 1 X 2 X 3 

249.038 

7752.69 

0.7231 

264.324 

X 4 

613.411 

8445.98 

0.1663 

259.968 

X 1 X 2 X 3 X 4 

266.542 

10089.94 

0.7365 

297.640 

X 1 X 2 X 4 

317.783 

10591.58 

0.6466 

294.044 
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(d) It appears that the model with X 2 = LLS and x b = Power is the best in terms of 
PRESS, s 2 , and |^|, 

i 

12.51 (a) y = -587.21085 + 428.43313a;. 

(b) y = 1180.00032 - 192.69121a; + 35.20945a; 2 . 

(c) The summary of the two models are given as: 


Model 

s 2 

R 2 

PRESS 

fly = A) + Pix 

1,105,054 

0.8378 

18,811,057.08 

l-iy = (do + (3ix + fa ix 2 

430,712 

0.9421 

8,706,973.57 


It appears that the model with a quadratic term is preferable. 

12.52 The parameter estimate for (3^ is 0.22365 with a standard error of 0.13052. Hence, 
t = 1.71 with P-value = 0.6117. Fail to reject Hq. 


12.53 = 20,588.038, d 2 bll = 62.650, and <x 6l6ll = -1, 103.423. 

12.54 (a) The following is the summary of the models. 


Model 

s 2 

R 2 

PRESS 

c p 

X 2 X 3 

8094.15 

0.51235 

282194.34 

2.0337 

X2 

8240.05 

0.48702 

282275.98 

1.5422 

xix 2 

8392.51 

0.49438 

289650.65 

3.1039 

X1X2X3 

8363.55 

0.51292 

294620.94 

4.0000 

x 3 

8584.27 

0.46559 

297242.74 

2.8181 

Xl 

8727.47 

0.45667 

304663.57 

3.3489 

xix 3 

8632.45 

0.47992 

306820.37 

3.9645 


(b) The model with lnfb^) appears to have the smallest C p with a small PRESS. 
Also, the model ln(x 2 ) and ln(x 3 ) has the smallest PRESS. Both models appear 
to better than the full model. 


12.55 (a) There are many models here so the model summary is not displayed. By using 

MSE criterion, the best model, contains variables x\ and X 3 with s 2 = 313.491. 
If PRESS criterion is used, the best model contains only the constant term with 
s 2 = 317.51. When the C p method is used, the best model is model with the 
constant term. 


(b) The normal probability plot, for the model using intercept only, is shown next. 
We do not appear to have the normality. 



- 2-10 1 2 


Theoretical Quantiles 
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12.56 (a) Volt = —1.64129 + 0.000556 Speed — 67.39589 Extension. 

(b) P - values for the f-tests of the coefficients are all < 0.0001. 

(c) The R 2 = 0.9607 and the model appears to have a good fit. The residual plot 
and a normal probability plot are given here. 

Normal Q-Q Plot 



12.57 (a) y = 3.13682 + 0.64443a;i - 0.01042x 2 + 0.50465x 3 - 0.11967x 4 - 2.46177x 5 + 
1.50441 x 6 . 

(b) The final model using the stepwise regression is 

y = 4.65631 + 0.51133x 3 - 0.12418x 4 . 

(c) Using C p criterion (smaller the better), the best model is still the model stated in 
(b) with s 2 = 0.73173 and R 2 = 0.64758. Using the s 2 criterion, the model with 
X \ , x 3 and x 4 has the smallest value of 0.72507 and R 2 = 0.67262. These two 
models are quite competitive. However, the model with two variables has one less 
variable, and thus may be more appealing. 

(d) Using the model in part (b), displayed next is the Studentized residual plot. Note 
that observations 2 and 14 are beyond the 2 standard deviation lines. Both of 
those observations may need to be checked. 



12.58 The partial U-test shows a value of 0.82, with 2 and 12 degrees of freedom. Conse- 
quently, the P - value = 0.4622, which indicates that variables x\ and xq can be excluded 
from the model. 






180 


Chapter 12 Multiple Linear Regression and Certain Nonlinear Regression Models 


12.59 


12.60 


12.61 


(a) y = 125.86555 + 7.75864a;i + 0.09430x 2 - 0.00919 xix 2 . 

(b) The following is the summary of the models. 


Model 

s 2 

R 2 

PRESS 

c p 

X2 

680.00 

0.80726 

7624.66 

2.8460 

X\ 

967.91 

0.72565 

12310.33 

4.8978 

xix 2 

650.14 

0.86179 

12696.66 

3.4749 

X 1 X 2 X 3 

561.28 

0.92045 

15556.11 

4.0000 


It appears that the model with x 2 alone is the best. 

(a) The fitted model is y = 85.75037- 15.93334xi + 2.42280x 2 + 1.82754x 3 + 3.07379x 4 . 

(b) The summary of the models are given next. 


Model 

s 2 

PRESS 

R 2 

c p 

X1X2X4 

9148.76 

447, 884.34 

0.9603 

3.308 

X4 

19170.97 

453,304.54 

0.8890 

8.831 

X3X4 

21745.08 

474,992.22 

0.8899 

10.719 

X1X2X3X4 

10341.20 

482,210.53 

0.9626 

5.000 

X1X4 

10578.94 

488,928.91 

0.9464 

3.161 

X2X4 

21630.42 

512,749.78 

0.8905 

10.642 

X2X3X4 

25160.18 

532,065.42 

0.8908 

12.598 

X1X3X4 

12341.87 

614,553.42 

0.9464 

5.161 

X2 

160756.81 

1,658,507.38 

0.0695 

118.362 

X3 

171264.68 

1,888,447.43 

0.0087 

126.491 

X2X3 

183701.86 

1,896,221.30 

0.0696 

120.349 

X\ 

95574.16 

2,213,985.42 

0.4468 

67.937 

X4X3 

107287.63 

2,261,725.49 

0.4566 

68.623 

X4X2 

109137.20 

2,456,103.03 

0.4473 

69.875 

XXX2X3 

125126.59 

2,744,659.14 

0.4568 

70.599 


When using PRESS as well as the s 2 criterion, a model with x±, x 2 and x\ appears 
to be the best, while when using the C p criterion, the model with x\ and X 4 is the 
best. When using the model with xi, x 2 and X4, we find out that the P- value for 
testing fa = 0 is 0.1980 which implies that perhaps x 2 can be excluded from the 
model. 

(c) The model in part (b) has smaller C p as well as competitive PRESS in comparison 
to the full model. 

n 

Since H = X(X‘X)~ 1 X l , and h,i = tr (H), we have 

i = 1 

n 

Ki = tr(X(X L X)~ l X L ) = tr(X‘X(X‘X) _1 ) = tr(/ p ) = p, 

i= 1 

where I p is the p x p identity matrix. Here we use the property of tr(AB) = tr (BA) 
in linear algebra. 
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12.62 (a) y = 9.9375 + 0.6125a;i + 1.3125x2 + 1.4625x 3 . 

(b) The ANOVA table for all these singlc-degree-of- freedom components can be dis- 
played as: 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Xi 

1 

3.00125 

0.95 

0.3847 


1 

13.78125 

4.37 

0.1049 


1 

17.11125 

5.42 

0.0804 

Error 

4 

3.15625 



Total 

7 


Only X3 is near significant. 

12.63 (a) For the completed second-order model, we have 

n 

PRESS = 9, 657, 641.55, | | = 5, 211.37. 

2—1 

(b) When the model does not include any term involving 24, 

n 

PRESS = 6, 954.49, ^ | y t - fa ,_*| = 277.292. 

2—1 

Apparently, the model without X 4 is much better. 

(c) For the model with x\. 

n 

PRESS = 312, 963.71, | - fa_i\ = 762.57. 

2=1 

For the model without X 4 : 

n 

PRESS = 3, 879.89, ^ \y t - fa t _i\ = 220.12 

2=1 

n 

Once again, the model without X 4 performs better in terms of PRESS and \y— 

2=1 

Vi,—i I • 

12.64 (a) The stepwise regression results in the following fitted model: 

y = 102.20428 - 0.21962xi - 0.0723 - x 2 - 2.68252x 3 - 0.37340x 5 + 0.30491x 6 . 

(b) Using the C p criterion, the best model is the same as the one in (a). 

12.65 (a) Yes. The orthogonality is maintained with the use of interaction terms. 
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(b) No. There are no degrees of freedom available for computing the standard error. 

12.66 The fitted model is y — 26. 19333+0. 04772x1+0.76011x2— 0.00001334a;ii — 0.00687x22 + 
0.00011333x12- The t-tests for each coefficient show that X 12 and X 22 may be eliminated. 
So, we ran a test for /3 12 = P 22 = 0 which yields P-value = 0.2222. Therefore, both X 12 
and x 2 2 may be dropped out from the model. 

12.67 (a) The fitted model is y = -0.26891 + 0.07755xi + 0.02532x 2 - 0.03575x 3 . The 

/-value of the test for H 0 : j3\ = /3 2 = @3 = 0 is 35,28 with P-value < 0.0001. 
Hence, we reject H 0 . 

(b) The residual plots are shown below and they all display random residuals. 



0 2 4 6 8 

x3 


(c) The following is the summary of these three models. 


Model 

PRESS 

c p 

Xi,X 2 ,X 3 

Xi,X 2 ,X 3 ,Xi,X 2 ,x| 

Xi, x 2 , x 3 , x\, x|, x|, X12, X 13 , x 23 

0.091748 

0.08446 

0.088065 

24.7365 

12.3997 

10 


It is difficult to decide which of the three models is the best. Model I contains 
all the significant variables while models II and III contain insignificant variables. 
However, the C p value and PRESS for model are not so satisfactory. Therefore, 
some other models may be explored. 

12.68 Denote by Z\ — 1 when Group=l, and Z 1 = 0 otherwise; 

Denote by Z 2 = 1 when Group=2, and Z 2 = 0 otherwise; 

Denote by Z 3 = 1 when Group=3, and Z 3 = 0 otherwise; 
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12.69 


12.70 


12.71 


12.72 


(a) The parameter estimates are: 


Variable 

DF 

Parameter Estimate 

P-value 

Intercept 

1 

46.34694 

0.0525 

BMI 

1 

-1.79090 

0.0515 

Zl 

1 

-23.84705 

0.0018 

Z2 

1 

-17.46248 

0.0109 


Yes, Group I has a mean change in blood pressure that was significantly lower 
than the control group. It is about 23.85 points lower. 

(b) The parameter estimates are: 


Variable 

DF 

Parameter Estimate 

P-value 

Intercept 

1 

28.88446 

0.1732 

BMI 

1 

-1.79090 

0.0515 

Zl 

1 

-6.38457 

0.2660 

Z 3 

1 

17.46248 

0.0109 


Although Group I has a mean change in blood pressure that was 6.38 points lower 
than that of Group II, the difference is not very significant clue to a high P-value. 

(b) All possible regressions should be run. P 2 = 0.9908 and there is only one signifi- 
cant variable. 

(c) The model including x-i , x 3 and x$ is the best in terms of C p , PRESS and has all 
variables significant. 

Using the formula of P 2 d j on page 467, we have 


SSE/(n -k-1) MSE 

SST/(n- 1) ““ ~~ MST ' 


Since MST is fixed, maximizing P 2 d j is thus equivalent to minimizing MSE. 

(a) The fitted model is p= 1+e2 . 762 o-o.o 30 to • 

(b) The x 2 -values for testing 6 0 = 0 and b\ — 0 are 471.4872 and 243.4111, respec- 
tively. Their corresponding P-values are < 0.0001. Hence, both coefficients are 
significant. 

(c) ED 50 = = 89.675. 

(a) The fitted model is p = 1+e2 . 9 949 -o.o 308 . • 

(b) The increase in odds of failure that results by increasing the load by 20 lb/in. 2 is 

e (20)(0.0308) = Lg515 _ 
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13.1 Using the formula of SSE, we have 

k n k n 


SSE = XM' - = - u.) 2 = 

i— 1 j = 1 2= 1 j = 1 2=1 


E4- 

. 1=1 


ne- 


Hence 


E(SSE) = J2 


■i = i Ll=i 


E E (< 






k r 


E 

j=i 


cr 


na 


n- 


n 


= k(n — l)a 2 


Thus E 


SSE 
k(n— 1) 


fc(n— 1)(7 2 2 

fc(n— 1) ^ 


fc fc 

13.2 Since SSA = n (y t . — y..) 2 = n ^ — kny 2 , yij ~ n(y, y + aj,cr 2 ), and hence 

2=1 2=1 


Vi. ~ n (y; h + a*, and y„ ~ n (y + a, , then 


and 


£(y 2 ) = Var{yi) + [F(y,:. )] 2 = — + (y + a*) 2 , 

n 


E[y 2 } = f+(y + a ) 2 = f + y 2 , 

kn kn 


due to the constraint on the a’s. Therefore, 

k k 

E(SSA) = n E(y 2 ) — knE(y 2 ) = kcr 2 + n + a*) 2 — (u 2 + kny 2 ) 
2=1 

k 

= (k- l)a 2 + n^a 2 . 


2=1 


2=1 
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13.3 The hypotheses are 

H 0 : pi — p 2 — ■ ■ ■ — P6, 

H i : At least two of the means are not equal. 


a = 0.05. 

Critical region: / > 2.77 with V\ = 5 and v 2 = 18 degrees of freedom. 
Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Treatment 

5.34 

5 

1.07 

0.31 

Error 

62.64 

18 

3.48 


Total 

67.98 

23 




with P-value=0.9024. 

Decision: The treatment means do not differ significantly. 

13.4 The hypotheses are 


Hq : Pl — P2 ~ ' ' ' — p5r 

Hi : At least two of the means are not equal. 


a = 0.05. 

Critical region: / > 2.87 with V\ = 4 and v 2 = 20 degrees of freedom. 
Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Tablets 

78.422 

4 

19.605 

6.59 

Error 

59.532 

20 

2.977 


Total 

137.954 

24 




with P-value=0.0015. 

Decision: Reject Hq. The mean number of hours of relief differ significantly. 

13.5 The hypotheses are 

Hq : pi = P 2 = P3, 

H i : At least two of the means are not equal. 


a = 0 . 01 . 

Computation: 
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Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Shelf Height 

399.3 

2 

199.63 

14.52 

Error 

288.8 

21 

13.75 


Total 

688.0 

23 




with P-value=0.0001. 

Decision: Reject H 0 . The amount of money spent on dog food differs with the shelf 
height of the display. 

13.6 The hypotheses are 

Hq : Ha = I^b = hcs 

Hi : At least two of the means are not equal. 


a = 0 . 01 . 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Drugs 

158.867 

2 

79.433 

5.46 

Error 

393.000 

27 

14.556 


Total 

551.867 

29 




with P-value=0.0102. 

Decision: Since a = 0.01, we fail to reject Hq. However, this decision is very marginal 
since the P-value is very close to the significance level. 

13.7 The hypotheses are 

Ho '■ Ei = — AD = /H, 

H i : At least two of the means are not equal. 

a = 0.05. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Treatments 

119.787 

3 

39.929 

2.25 

Error 

638.248 

36 

17.729 


Total 

758.035 

39 




with P-value=0.0989. 

Decision: Fail to reject H 0 at level a = 0.05. 
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13.8 The hypotheses are 

H 0 : p t = p 2 = P'3-i 

Hi : At least two of the means are not equal. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Solvents 

3.3054 

2 

1.6527 

24.51 

Error 

1.9553 

29 

0.0674 


Total 

5.2608 

31 




with P-value< 0.0001. 

Decision: There is significant difference in the mean sorption rate for the three solvents. 
The mean sorption for the solvent Chloroalkanes is the highest. We know that it is 
significantly higher than the rate of Esters. 

13.9 The hypotheses are 

H 0 : pi = P 2 = P3 = P&i 

Hi : At least two of the means are not equal. 

a = 0 . 01 . 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Treatments 

27.5506 

3 

9.1835 

8.38 

Error 

18.6360 

17 

1.0962 


Total 

46.1865 

20 




with P-value= 0.0012. 

Decision: Reject H 0 . Average specific activities differ. 


13.10 S 50 = 3.2098, S 100 = 4.5253, S 200 = 5.1788, and S 400 = 3.6490. Since the sample sizes 
are all the same, 


4=4 


tX>? = 17 - 729L 


2=1 


Therefore, the Bartlett’s statistic is 


b = 


4 \ V4 


n* 

2=1 


0.9335. 
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Using Table A. 10, the critical value of the Bartlett’s test with k — 4 and a = 0.05 is 
0.7970. Since b > 0.7970, we fail to reject H 0 and hence the variances can be assumed 
equal. 

13.11 Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

B vs. A, C, D 

30.6735 

1 

30.6735 

14.28 

C vs. A, D 

49.9230 

1 

49.9230 

23.23 

A vs. D 

5.3290 

1 

5.3290 

2.48 

Error 

34.3800 

16 

2.1488 



(a) P-value=0.0016. B is significantly different from the average of A, C , and D. 

(b) P-value=0.0002. C is significantly different from the average of A and D. 

(c) P-value=0.1349. A can not be shown to differ significantly from D. 


13.12 (a) The hypotheses are 


H o : /i29 — P54 — h84 1 

Hi : At least two of the means are not equal. 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Protein Levels 

32,974.87 

2 

16,487.43 

5.15 

Error 

28,815.80 

9 

3,201.76 


Total 

61,790.67 

11 




with P- value— 0.0323. 

Decision: Reject Hq. The mean nitrogen loss was significantly different for the 
three protein levels. 

(b) For testing the contrast L = 2^29 ~ R 54 — h 84 at level a = 0.05, we have SSw = 
31,576.42 and / = 9.86, with P-value=0.0119. Hence, the mean nitrogen loss 
for 29 grams of protein was different from the average of the two higher protein 
levels. 


13.13 (a) The hypotheses are 


Ho : Pl — /l2 — h3 ~ h4) 

Hi : At least two of the means are not equal. 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Treatments 

1083.60 

3 

361.20 

13.50 

Error 

1177.68 

44 

26.77 


Total 

2261.28 

47 
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with P- value < 0.0001. 

Decision: Reject H 0 . The treatment means are different. 

(b) For testing two contrasts L\ — pi — /i 2 and L 2 — p 3 — /J4 at level a = 0.01, we 
have the following 


Contrast 

Sum of Squares 

Computed / 

P-value 

1 vs. 2 

785.47 

29.35 

< 0.0001 

3 vs. 4 

96.00 

3.59 

0.0648 


Hence, Bath I and Bath II were significantly different for 5 launderings, and Bath 
I and Bath II were not different for 10 launderings. 

13.14 The means of the treatments are: 

Vi. = 5.44,1/2. = 7.90, 2/3. = 4.30,1/4. = 2.98, and y 5 , = 6.96. 

Since </(0.05, 5, 20) = 4.24, the critical difference is (4.24) = 3.27. Therefore, 

the Tukey’s result may be summarized as follows: 

Vh. y-i. yi. Vs. 2 / 2 . 

2.98 4.30 5.44 6.96 7.90 


13.15 Since g(0.05,4, 16) = 4.05, the critical difference is (4.05) = 2.655. Hence 

y.3. y\. Ik V 2 . 

56.52 59.66 61.12 61.96 

13.16 (a) The hypotheses are 

Ho '■ Pi — p'2 = P3 = Pii 

Hi : At least two of the means are not equal. 


a = 0.05. 
Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Blends 

119.649 

3 

39.883 

7.10 

Error 

44.920 

8 

5.615 


Total 

164.569 

11 




with P-value= 0.0121. 

Decision: Reject Hq. There is a significant difference in mean yield reduction for 
the 4 preselected blends. 
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(b) Since \/s 2 /3 = 1.368 we get 


Therefore, 


p 

2 

3 

4 

r 'p 

3.261 

3.399 

3.475 

Rp 

4.46 

4.65 

4.75 


2/3. 2/i. 2/2. 2/4. 

23.23 25.93 26.17 31.90 


(c) Since g(0.05,4,8) = 4.53, the critical difference is 6.20. Hence 

2/3. 2/i. 2/2. 2/4. 

23.23 25.93 26.17 31.90 


13.17 (a) The hypotheses are 

Ho '■ hi = h'2 — ' ' ' = p 5 ) 

PA : At least two of the means are not equal. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Procedures 

7828.30 

4 

1957.08 

9.01 

Error 

3256.50 

15 

217.10 


Total 

11084.80 

19 




with P-value= 0.0006. 

Decision: Reject H 0 . There is a significant difference in the average species count 
for the different procedures. 

(b) Since g(0.05, 5, 15) = 4.373 and y _ 7 . 357 , the critical difference is 32.2. 
Hence 

2 Ik Vs Vsub 2 1m 2 Id 

12.50 24.25 26.50 55.50 64.25 


13.18 The hypotheses are 

Hq '■ lii = /12 = • • • = R5, 

H\ : At least two of the means are not equal. 


Computation: 



192 


Chapter 13 One-Factor Experiments: General 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Angles 

99.024 

4 

24.756 

21.40 

Error 

23.136 

20 

1.157 


Total 

122.160 

24 




with P-value< 0.0001. 

Decision: Reject Hq. There is a significant difference in mean pressure for the different 
angles. 

13.19 When we obtain the ANOVA table, we derive s 2 = 0.2174. Hence 

y/2s 2 /n = \J (2)(0.2174)/5 = 0.2949. 

The sample means for each treatment levels are 

y c = 6.88, yi. = 8.82, y 2 . = 8.16, y 3 . = 6.82, y 4 . = 6.14. 


Hence 


d i 

d 3 


8.82-6.88 

0.2949 

6.82-6.88 

0.2949 


6.579, 


-0.203, 


d 2 — 


c / 4 


8.16-6.88 

0.2949 “ 

6.14-6.88 

“ 0.2949 


4.340, 


= -2.509. 


From Table A. 14, we have ^ 0 . 025 ( 4 , 20) = 2.65. Therefore, concentrations 1 and 2 are 
significantly different from the control. 


13.20 The ANOVA table can be obtained as follows: 


Source of 

Sum of 

Degrees of 

Mean Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Cables 

1924.296 

8 

240.537 

9.07 

Error 

2626.917 

99 

26.535 


Total 

4551.213 

107 



with P-value< 0.0001. 





The results from Tukey’s procedure 

can be obtained as follows: 


V2. V3. 

yi. 

Va. Ve. 

2 / 7 . 2 / 5 . 

2 / 8 . 

-7.000 -6.083 

-4.083 

2.667 0.833 

0.917 1.917 

3.333 


The cables are significantly different: 
9 is different from 4, 1, 2, 3 
8 is different from 1, 3, 2 
5, 7, 6 are different from 3, 2. 
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13.21 Aggregate 4 has a significantly lower absorption rate than the other aggregates. 


13.22 (a) The hypotheses are 

Hq : He = hL = I^M = hH, 

H i : At least two of the means are not equal. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Finance Leverages 

80.7683 

3 

26.9228 

5.34 

Error 

100.8700 

20 

5.0435 


Total 

181.6383 

23 




with P-value= 0.0073. 

Decision: Reject H 0 . The means are not all equal for the different financial 
leverages. 

(b) \j2s 2 /n = \J (2)(5.0435)/6 = 1.2966. The sample means for each treatment levels 
are 

yc = 4.3833, fji = 5.1000, ijh = 8.3333, yM = 8.4167. 

Hence 


d'L 

d,L 


5.1000 -4.3833 
1.2966 

8.333-4.3833 
1.2966 " 


= 0.5528, 
3.0464. 


cIm — 


8.4167-4.3833 

1.2966 


3.1108, 


From Table A. 14, we have ^ 0 . 025 ( 3 , 20) = 2.54. Therefore, the mean rate of return 
are significantly higher for median and high financial leverage than for control. 


13.23 The ANOVA table can be obtained as follows: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Temperatures 

1268.5333 

4 

317.1333 

70.27 

Error 

112.8333 

25 

4.5133 


Total 

1381.3667 

29 




with P-value< 0.0001. 

The results from Tukey’s procedure can be obtained as follows: 

Vo 2/25 V 1 00 1/75 2/50 

55.167 60.167 64.167 70.500 72.833 
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The batteries activated at temperature 50 and 75 have significantly longer activated 
life. 

13.24 The Duncan’s procedure shows the following results: 

Ve Va Vc 

0.3300 0.9422 1.0063 


Hence, the sorption rate using the Esters is significantly lower than the sorption rate 
using the Aromatics or the Chloroalkanes. 


13.25 Based on the definition, we have the following. 


SSB = kJ 2 ( y , 

3 = 1 

13.26 From the model 

and the constraints 

we obtain 
Hence 


b / rj-i rj-i \ 2 


y ^ = k Y ,\ iF -^ 

3 = 1 


b rp 2 ry2 b ^2 

V -4 - 2 — + — = V -i- 

^ k bk bk ^ h 


3 = 1 


Vij — p + + f3j + Cjj, 

k b 

cti = 0 and (3j = 0, 


i=l 


t=i 


y.j = p + ( 3 j + e.j and y.. = p + e... 

b 


ssb = k Efc - v-f = * + o - f . 

1=1 1=1 

Since E(e,j ) = 0 and E(e„) = 0, we obtain 

b b 

E(SSB) = E ^) ~ bkE( e 2 


^ ^2 


1=1 


1=1 


We know that H(e 2 ) = ^ and H(e 2 ) = Then 


l=i 


E(SSB) =kJ2Pj+ h(j2 ~(? 2 = {b~ l)a 2 + k J2 
l=i l=i 


T 2 

6A 7 ' 


13.27 (a) The hypotheses are 

Ho : cti = 0:2 = «3 = oq = 0, fertilizer effects are zero 
Hi : At least one of the ccds is not equal to zero. 

a = 0.05. 

Critical region: / > 4.76. 

Computation: 
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Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Fertilizers 

218.1933 

3 

72.7311 

6.11 

Blocks 

197.6317 

2 

98.8158 


Error 

71.4017 

6 

11.9003 


Total 

487.2267 

11 




P-value= 0.0296. Decision: Reject H 0 . The means are not all equal, 
(b) The results of testing the contrasts are shown as: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

(/1, /a) vs (/ 2 . / 4 ) 

206.6700 

1 

206.6700 

17.37 

fi vs / 3 

11.4817 

1 

11.4817 

0.96 

Error 

71.4017 

6 

11.9003 



The corresponding P- values for the above contrast tests are 0.0059 and 0.3639, 
respectively. Hence, for the first contrast, the test is significant and the for the 
second contrast, the test is insignificant. 

13.28 The hypotheses are 

H 0 : aq = a 2 = 0:3 = 0, no differences in the varieties 
Hi : At least one of the cq’s is not equal to zero. 

a = 0.05. 

Critical region: / > 5.14. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Treatments 

24.500 

2 

12.250 

1.74 

Blocks 

171.333 

3 

57.111 


Error 

42.167 

6 

7.028 


Total 

238.000 

11 




P-value=0.2535. Decision: Do not reject II 0 : could not show that the varieties of 
potatoes differ in field. 

13.29 The hypotheses are 

H 0 : ai = a 2 = a 3 = 0, brand effects are zero 

Hi : At least one of the cq’s is not equal to zero. 


a = 0.05. 

Critical region: / > 3.84. 
Computation: 
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Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Treatments 

27.797 

2 

13.899 

5.99 

Blocks 

16.536 

4 

4.134 


Error 

18.556 

8 

2.320 


Total 

62.889 

14 




P-value=0.0257. Decision: Reject Hq] mean percent of foreign additives is not the 
same for all three brand of jam. The means are: 

Jam A: 2.36, Jam B: 3.48, Jam C: 5.64. 

Based on the means, Jam A appears to have the smallest amount of foreign additives. 

13.30 The hypotheses are 

H 0 : a i = o ?2 = «3 = «4 = 0, courses are equal difficulty 
H\ : At least one of the cq’s is not equal to zero. 

a = 0.05. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Subjects 

42.150 

3 

14.050 

0.15 

Students 

1618.700 

4 

404.675 


Error 

1112.100 

12 

92.675 


Total 

2772.950 

19 




P-value=0.9267. Decision: Fail to reject Hq ; there is no significant evidence to conclude 
that courses are of different difficulty. 

13.31 The hypotheses are 

Hq : cti = a 2 = • • • = «6 = 0, station effects are zero 
Hi : At least one of the cq’s is not equal to zero. 


a = 0 . 01 . 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Stations 

230.127 

5 

46.025 

26.14 

Dates 

3.259 

5 

0.652 


Error 

44.018 

25 

1.761 


Total 

277.405 

35 
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P-value< 0.0001. Decision: Reject Ho ; the mean concentration is different at the 
different stations. 

13.32 The hypotheses are 

Hq : ay = a 2 = a 3 = 0, station effects are zero 
Hi : At least one of the ads is not equal to zero. 

a = 0.05. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Stations 

10.115 

2 

5.057 

0.15 

Months 

537.030 

11 

48.821 


Error 

744.416 

22 

33.837 


Total 

1291.561 

35 




P -value— 0.8620. Decision: Do not reject Ho ; the treatment means do not differ 
significantly. 

13.33 The hypotheses are 

Ho : ai = a 2 = a 3 = 0, diet effects are zero 
Hi : At least one of the ccd s is not equal to zero. 

a = 0.01. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Diets 

4297.000 

2 

2148.500 

11.86 

Subjects 

6033.333 

5 

1206.667 


Error 

1811.667 

10 

181.167 


Total 

12142.000 

17 




P-value= 0.0023. Decision: Reject Ho ; differences among the diets are significant. 

13.34 The hypotheses are 

Po : «i = «2 = «3 = 0, analyst effects are zero 
Hi : At least one of the ads is not equal to zero. 


a = 0.05. 
Computation: 
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Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Analysts 

0.001400 

2 

0.000700 

3.00 

Individuals 

0.021225 

3 

0.007075 


Error 

0.001400 

6 

0.000233 


Total 

0.024025 

11 




P-value= 0.1250. Decision: Do not reject H 0 ', cannot show that the analysts differ 
significantly. 

13.35 The hypotheses are 

H 0 : a i = a 2 = a 3 = a 4 = a 5 = 0, treatment effects are zero 
Hi : At least one of the cq’ s is not equal to zero. 


a = 0 . 01 . 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Treatments 

79630.133 

4 

19907.533 

0.58 

Locations 

634334.667 

5 

126866.933 


Error 

689106.667 

20 

34455.333 


Total 

1403071.467 

29 




P-value= 0.6821. Decision: Do not reject H 0 ; the treatment means do not differ 
significantly. 

13.36 The hypotheses are 

H 0 : a± = 0:2 = «3 = 0, treatment effects are zero 
Hi : At least one of the cq’s is not equal to zero. 

a = 0 . 01 . 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Treatments 

203.2792 

2 

101.6396 

8.60 

Subjects 

188.2271 

9 

20.9141 


Error 

212.8042 

18 

11.8225 


Total 

604.3104 

29 




P-value= 0.0024. Decision: Reject Ho ; the mean weight losses are different for different 
treatments and the therapists had the greatest effect on the weight loss. 
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13.37 The total sums of squares can be written as 

Y Y Y^ k ~ y^ 2 YY Y^- ~ y-) + _ y-) + (£■* ~ y-) 

i j k i j k 

+ ( Uijk Vi., y.j. y..k T 2 y...)] 

=r Y(y*~ - y -) 2 + r Y(y-f ~ y-) 2 + r Y(y- k ~~ y-) 2 

i j k 

+ YY Y^ k - y>- ~ y-j- ~ + 2 &») 2 

i j k 

+ 6 cross-product terms, 
and all cross-product terms are equal to zeroes. 

13.38 For the model = p + cq + /3j + Tk + e^, we have 


y..k = V + n + e..fc, and y,„ = p + e... 
Hence SlSTr = r - y...) 2 = r ^(r fc + e.. fc - e...) 2 , and 


B(SSTr) = r V r 2 + r V S(<y) - r 2 V S(5 2 ) 

k k k 

= r Y T % + r Yy- r2 ^ = r Y T Z + 

k k k 

- l )° 2 + r Y T k- 


2 2 
rcr — er 


= r 


13.39 The hypotheses are 

77 0 : ri = r 2 = r 3 = r 4 = 0, professor effects are zero 
H\ : At least one of the rb s is not equal to zero. 


a = 0.05. 
Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Time Periods 

474.50 

3 

158.17 


Courses 

252.50 

3 

84.17 


Professors 

723.50 

3 

241.17 

5.03 

Error 

287.50 

6 

47.92 


Total 

1738.00 

15 




P-value= 0.0446. Decision: Reject H 0 ; grades are affected by different professors. 
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13.40 The hypotheses are 

Hq : ta = tb = Tc = t d = te = 0, color additive effects are zero 
Hi : At least one of the rys is not equal to zero. 


a = 0.05. 
Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Workers 

12.4344 

4 

3.1086 


Days 

14.7944 

4 

3.6986 


Additives 

3.9864 

4 

0.9966 

1.29 

Error 

9.2712 

12 

0.7726 


Total 

40.4864 

24 




P-value= 0.3280. Decision: Do not reject H 0 ; color additives could not be shown to 
have an effect on setting time. 

13.41 The hypotheses are 

H 0 : a± — a 2 — a 3 — 0, dye effects are zero 
H i : At least one of the cq’s is not equal to zero. 

a = 0.05. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Amounts 

1238.8825 

2 

619.4413 

122.37 

Plants 

53.7004 

1 

53.7004 


Error 

101.2433 

20 

5.0622 


Total 

1393.8263 

23 




P-value< 0.0001. Decision: Reject H 0 ] color densities of fabric differ significantly for 
three levels of dyes. 

13.42 (a) After a transformation g(y) = y/y, we come up with an ANOVA table as follows. 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Materials 

7.5123 

2 

3.7561 

16.20 

Error 

6.2616 

27 

0.2319 


Total 

13.7739 

29 
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(b) The P-value< 0.0001. Hence, there is significant difference in flaws among three 
materials. 

(c) A residual plot is given below and it does show some heterogeneity of the variance 
among three treatment levels. 


I 

1.0 


I 

1.5 


+ 

+ 

+ 

+ 

r~ 

3.0 


(d) The purpose of the transformation is to stabilize the variances. 

(e) One could be the distribution assumption itself. Once the data is transformed, it 
is not necessary that the data would follow a normal distribution. 

(f) Here the normal probability plot on residuals is shown. 


Theoretical Quantiles 


It appears to be close to a straight 
line. So, it is likely that the transformed data are normally distributed. 


13.43 (a) The hypotheses are 


Ho : o 2 a — 0 , 

Hi : o* a * 0 


a = 0.05. 
Computation: 
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Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Operators 

371.8719 

3 

123.9573 

14.91 

Error 

99.7925 

12 

8.3160 


Total 

471.6644 

15 




P-value= 0.0002. Decision: Reject H 0 ] operators are different, 
(b) <t 2 = 8.316 and d 2 = 12 3-9573-s.3i60 = 2 8.910. 

13.44 The model is y t -j = p + Ai + Bj + e ty Hence 

y.j = p + A, + Bj + e.j, and y„ = p + A, + P. + e... 


Therefore, 


and 


b 


As 

3 = 1 


E( 

j=i 


SSB = k Vfc - S ..) 2 = A- yp, - S ) + fe - t ,)] 2 


b b 

E(SSB) = kj^ E{B 2 ) - kbE(B 2 ) + k E(e 2 j) - kbE(e 2 ) 

j = 1 i =1 

= kba 2 p — ka 2 p + ba 2 — a 2 = (b — l)a 2 + k(b — l)cr|. 

13.45 (a) The hypotheses are 

Ho '■ = 0 , 

H.-.al^O 

a = 0.05. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Treatments 

23.238 

3 

7.746 

3.33 

Blocks 

45.283 

4 

11.321 


Error 

27.937 

12 

2.328 


Total 

96.458 

19 




P-value= 0.0565. Decision: Not able to show a significant difference in the ran- 
dom treatments at 0.05 level, although the P-value shows marginal significance. 

( b ) ^ = 7 ' 746 - 2 - 328 = 1.084, and a 2 = 1L321 ~ 2 - 328 = 2.248. 

13.46 From the model 

Vijk — y + Ai + Bj + Pfc + €ij, 
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13.47 


we have 


Hence, 


and 


y..k — y 4” A + B + T/j + and y_ — p + A. + B + T + e... 


SSTr = r 


k - y.J = r 

k k 


Eiw - t) + ( ? ~* - oi 2 > 


E(SSTr) = r E E (Th - + r y E Cy - r 2 E(t: 


k 

= r 2 a : 


(a) The matrix is 


A = 



ra. 2 

+ ra 2 — 

a 2 

— 

V — 

1)(* 2 4 

pfc 

b 

b 

b 

k 

k 

••• k~\ 

6 

b 

0 ••• 

0 

1 

1 

... 1 

6 

0 

b 

0 

1 

1 

. . . 1 

6 

0 

0 ••• 

b 

1 

1 

. . . 1 

k 

1 

1 ••• 

k 

0 

0 

... 0 

k 

1 

1 ••• 

0 

k 

0 

... 0 

k 

1 

1 ••• 

0 

0 

0 

••• k 


where h = number of blocks and k = number of treatments. The vectors are 

b' = (p, an, ol2i ■ ■ ■ , oi k , Pi, P2, , Pb)', and 

g = (T.,Ti.,T 2 ., ■ ■ ■ , 7 7.1. 7.2. • • • ,T b ) . 

k b 

(b) Solving the system Ab = g with the constraints a i = 0 and Pj = 0 > we 

' 1 j = 1 

have 


Therefore, 


h = y 

6ii = yi.-y.., for i = 1,2, . . . , k, 

Pj = y.j-y for j = 1,2, ...,b. 


T 2 

R{oi 1 , a 2 , • • • , a*, /3i, P2, ■ ■ ■ , Pb) =b'g4g 


T. 


2 6 T 2 t 2 


y - y —i _ 2—. 

^ t ^ k bk 

1=1 j = 1 
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To find @ 2 , • • • , A> | ai, « 2 , • • • , Oik) we first find R(a i, « 2 , • • • , oik)- Setting 
/ 3j = 0 in the model, we obtain the estimates (after applying the constraint 

k 

E «* = o) 

2= 1 

p = y.., and &i = tji, — y__, for i = 1, 2, . . . , k. 

The g vector is the same as in part (a) with the exception that T i, T. 2 , , .. . , Tj, do 
not appear. Thns one obtains 

k rp 2 rp2 
R(oi\i Ol2i • • • i Oik) ^ ~' J i ^ ^ 

2= 1 

and thus 

R(Ph An • • ■ , A> I «i, « 2 , • • • , oi k ) = R{o i, a 2 , • • • , a*, An /3 2 , — , A>) 

b p'2 rp2 

- R(a 1 , a 2 , • • • , Ok) = -jr - ^7 = SS-B. 

t=i A ^ ' 


13.48 Since 


1-/? 


P 


P(3, 12) > 


3.49 

1 + (4) (1-5) 


P[F( 12,3) < 2.006] < 0.95. 


Hence we do not have large enough samples. We then find, by trial and error, that 
n — 16 is sufficient since 


1 ~P 


P 


P(3, 60) > 


2.76 

1 + (16)(1.5) 


P[P(60, 3) < 9.07] > 0.95. 


4 2 4 2 

13.49 We know </> 2 = b E ^ = §, when E ^ = 2.0. 

2=1 2=1 

If b = 10, (f) = 2.24; v\ = 3 and V 2 = 27 degrees of freedom. 
If b = 9, (f) = 2.12; Vi = 3 and v 2 = 24 degrees of freedom. 

If b = 8, (f) = 2.00; i>i = 3 and v 2 = 21 degrees of freedom. 
From Table A. 16 we see that 6 = 9 gives the desired result. 


13.50 For the randomized complete block design we have 


E(Sl) = E 



Therefore, 


_ th [£(£?)] _ vi 

2a 2 2 


(k~ 1) 


o- 2 + 6 E «?/(* - 1) 


2=1 


2a 2 


k — 1 



2=1 


2 
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and then 


E(S 2 ) - O' 2 


cr z 


Vi + 1 


[a 2 + b£a?/(k 

i= 1 

a 2 


i)] 


a~ 


k-l 


%— 1 


a t 

ka 2 


13.51 (a) The model is y lJ = p + cq + where cq ~ n(0, a 2 ). 

(b) Since s 2 = 0.02056 and s 2 = 0.01791, we have a 2 = 0.02056 and s ' 10 s = 

0 . 01191 -^ 0.02056 _ —0.00027, which implies d 2 = 0. 

13.52 (a) The P-valne of the test result is 0.7830. Hence, the variance component of pour 

is not significantly different from 0. 

(b) We have the ANOVA table as follows: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Pours 

0.08906 

4 

0.02227 

0.43 

Error 

1.02788 

20 

0.05139 


Total 

1.11694 

24 




Since ^ = 0-02227-0.05139 < 0> we have (j 2 = 

13.53 (a) yij = p + ati + e^-, where a* ~ n{x\ 0, cr 2 ). 

(b) Running an ANOVA analysis, we obtain the P-value as 0.0121. Hence, the loom 
variance component is significantly different from 0 at level 0.05. 

(c) The suspicion is supported by the data. 

13.54 The hypotheses are 


Ho : Pi — P2 — R3 — l- l 4 , 

Hi : At least two of the are not equal. 

a = 0.05. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Garlon levels 

3.7289 

3 

1.2430 

1.57 

Error 

9.5213 

12 

0.7934 


Total 

13.2502 

15 




P -value— 0.2487. Decision: Do not reject Hq] there is insufficient evidence to claim 
that the concentration levels of Garlon would impact the heights of shoots. 
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13.55 Bartlett’s statistic is b = 0.8254. Conclusion: do not reject homogeneous variance 
assumption. 

13.56 The hypotheses are 

Hq : ta = tb = Tc = td = te = 0, ration effects are zero 
Hi : At least one of the r*’ s is not zero. 


a = 0 . 01 . 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Lactation Periods 

245.8224 

4 

61.4556 


Cows 

353.1224 

4 

88.2806 


Rations 

89.2624 

4 

22.3156 

12.55 

Error 

21.3392 

12 

1.7783 


Total 

709.5464 

24 




P-value= 0.0003. Decision: Reject H 0 ; different rations have an effect on the daily 
milk production. 


13.57 It can be shown that yc = 76.16, y\ = 81.20, y 2 = 81.54 and y 3 = 80.98. Since this is 
a one-sided test, we find d 0 .oi(3, 16) = 3.05 and 



^(2)(3.52575) =llg756 


Hence 


d\ = 


81.20 - 76.16 
1.18756 


4.244, d 2 


81.54- 76.16 
1.18756 


4.532, d 3 


80.98 - 76.16 
1.18756 


4.059, 


which are all larger than the critical value. Hence, significantly higher yields are 
obtained with the catalysts than with no catalyst. 


13.58 (a) The hypotheses for the Bartlett’s test are 

TJ 2 2 2 2 

H 0 : a A = a B = a c = a D , 

H\ : The variances are not all equal. 


a = 0.05. 

Critical region: We have ri\ = n 2 = n 3 — n 4 = 5, N — 20, and k — 4. Therefore, 
we reject H 0 when b < 5 4 (0.05,5) = 0.5850. 

Computation: sa = 1.40819, sb = 2.16056, sc = 1.16276, sd = 0.76942 and 
hence s p = 1.46586. From these, we can obtain that b = 0.7678. 

Decision: Do not reject Hq] there is no sufficient evidence to conclude that the 
variances are not equal. 
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(b) The hypotheses are 

Ho : Pa = hs = he = ho, 

Hi : At least two of the /Vs are not equal. 

a = 0.05. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Laboratories 

85.9255 

3 

28.6418 

13.33 

Error 

34.3800 

16 

2.1488 


Total 

120.3055 

19 




P-value= 0.0001. Decision: Reject 1 1 o', the laboratory means are significantly 
different. 

(c) The normal probability plot is given as follows: 



- 2-10 1 2 


Theoretical Quantiles 


13.59 The hypotheses for the Bartlett’s test are 

Ho . oq — cr 2 — cr 3 — <v 4 , 

Hi : The variances are not all equal. 

a = 0 . 01 . 

Critical region: We have ri\ = 712 = n 3 = 4, 714 = 9 N — 21, and k — 4. Therefore, we 
reject Hq when 

b < 64(0.01,4,4,4,9) 

= (4) (0.3475) + (4) (0.3475) + (4) (0.3475) + (9) (0.6892) = Q 4g39 

Computation: = 0.41709, = 0.93857, = 0.25673, s\ = 1.72451 and hence 

Sp = 1.0962. Therefore, 

_ [(0.41709) 3 (0.93857) 3 (0.25763) 3 (1.72451) 8 ] 1/17 

0 — — 0 . 79 . 

1.0962 
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Decision: Do not reject Ho] the variances are not significantly different. 

13.60 The hypotheses for the Cochran’s test are 

H-0 ■ <j A — a B — a Ci 

H\ : The variances are not all equal. 


a = 0 . 01 . 

Critical region: g > 0.6912. 

Computation: s\ = 29.5667, s 2 B = 10.8889, Sq = 3.2111, and hence = 43.6667. 
Now, g = iff = 0.6771. 

Decision: Do not reject H 0 ] the variances are not significantly different. 

13.61 The hypotheses for the Bartlett’s test are 

u ■ H H H 

H\ : The variances are not all equal. 


a = 0.05. 

Critical region: reject Ho when 


b < 6 4 ( 0.05,9,8,15) 


(9)(0.7686) + (8)(0.7387) + (15)(0.8632) 
32 


0.8055. 


Computation: b = = 0 7822 

Decision: Reject H 0 ] the variances are significantly different. 

13.62 (a) The hypotheses are 

Hq : o :\ = = 0:3 — ol 4 — 0, 

Hi : At least one of the ccj’s is not zero. 

a = 0.05. 

Computation: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Diets 

822.1360 

3 

274.0453 

38.65 

Blocks 

17.1038 

5 

3.4208 


Error 

106.3597 

15 

7.0906 


Total 

945.5995 

23 




with P-value< 0.0001. 

Decision: Reject H 0 ] diets do have a significant effect on mean percent dry matter. 
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(b) We know that yc = 35.8483, yF = 36.4217, ijt = 45.1833, yA = 49.6250, and 


Hence, 



(2)(7.0906) 

6 


1.5374. 


d 


Ammonia 


d 


Urea Feeding 


d 


Urea Treated 


49.6250 - 35.8483 
1.5374 

36.4217-35.8483 

1.5374 

45.1833 - 35.8483 
1.5374 


8.961, 

0.3730, 

6.0719. 


Using the critical value d 0 .05(3, 15) = 2.61, we obtain that only “Urea Feeding” is 
not significantly different from the control, at level 0.05. 

(c) The normal probability plot for the residuals are given below. 


■■t 

a> cm 

c 
03 
Z3 

o 

«D _ 

Q. ° 

E 

03 

c n 

CM 
■■t 

- 2-10 1 2 
Theoretical Quantiles 



13.63 The hypotheses are 

H 0 : aq = oq = 03 = 0, 

Hi : At least one of the oii s is not zero. 

ct = 0.05. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Diet 

0.32356 

2 

0.16178 

9.33 

Error 

0.20808 

12 

0.01734 


Total 

0.53164 

14 
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with P-value= 0.0036. 

Decision: Reject H 0 ; zinc is significantly different among the diets. 

13.64 (a) The gasoline manufacturers would want to apply their results to more than one 

model of car. 

(b) Yes, there is a significant difference in the miles per gallon for the three brands 
of gasoline. 

(c) I would choose brand C for the best miles per gallon. 

13.65 (a) The process would include more than one stamping machine and the results might 

differ with different machines. 

(b) The mean plot is shown below. 



cork plastic rubber 

Material 


(c) Material 1 appears to be the best. 

(d) Yes, there is interaction. Materials 1 and 3 have better results with machine 1 
but material 2 has better results with machine 2. 

13.66 (a) The hypotheses are 

Hq : oy = 0*2 — = 0, 

H i : At least one of the cq’s is not zero. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Paint Types 

227875.11 

2 

113937.57 

5.08 

Error 

336361.83 

15 

22424.12 


Total 

564236.94 

17 




with P-value= 0.0207. 

Decision: Reject H 0 at level 0.05; the average wearing quality differs significantly 
for three paints. 

(b) Using Tukey’s test, it turns out the following. 
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V\. V3. V2. 

197.83 419.50 450.50 


Types 2 and 3 are not significantly different, while Type 1 is significantly different 
from Type 2. 

(c) We plot the residual plot and the normal probability plot for the residuals as 
follows. 



1.0 1.5 2.0 2.5 3.0 -2 -1 0 1 2 

Type Theoretical Quantiles 


It appears that the heterogeneity in variances may be violated, as is the normality 
assumption. 

(d) We do a log transformation of the data, i.e., y = log(y). The ANOVA result has 
changed as follows. 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Paint Types 

2.6308 

2 

1.3154 

6.07 

Error 

3.2516 

15 

0.2168 


Total 

5.8824 

17 




with P-value= 0.0117. 

Decision: Reject H 0 at level 0.05; the average wearing quality differ significantly 
for three paints. The residual and normal probability plots are shown here: 



1.0 1.5 2.0 2.5 3.0 -2 -1 0 1 2 

Type Theoretical Quantiles 


While the homogeneity of the variances seem to be a little better, the normality 
assumption may still be invalid. 
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13.67 (a) The hypotheses are 

Hq : Ckq — OL 2 — OL 3 — OL 4 — 0 , 

H\ : At least one of the cq’s is not zero. 

Computation: 


Source of 

Sum of 

Degrees of 

Mean 

Computed 

Variation 

Squares 

Freedom 

Square 

/ 

Locations 

0.01594 

3 

0.00531 

13.80 

Error 

0.00616 

16 

0.00039 


Total 

0.02210 

19 




with P-value= 0.0001. 

Decision: Reject Hq\ the mean ozone levels differ significantly across the locations, 
(b) Using Tukey’s test, the results are as follows. 

IJ4. yi. y.3, fh. 

0.078 0.092 0.096 0.152 

Location 2 appears to have much higher ozone measurements than other locations. 
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Factorial Experiments (Two or More 
Factors) 

14.1 The hypotheses of the three parts are, 

(a) for the main effects temperature, 

H 0 : ai = a 2 = a 3 = 0, 

H 1 : At least one of the a^’s is not zero; 

(b) for the main effects ovens, 

H'o : A = fo = fa = fa = 0, 

H 1 : At least one of the /3;’s is not zero; 

(c) and for the interactions, 

Hq : (a/3) ii = (a/3) 12 = • • • = (a/3) 34 = 0, 

H l : At least one of the (a/3)^’ s is not zero. 


a = 0.05. 

Critical regions: (a) /i > 3.00; (b) fa > 3.89; and (c) / 3 > 3.49. 
Computations: From the computer printout we have 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Temperatures 

5194.08 

2 

2597.0400 

8.13 

Ovens 

4963.12 

3 

1654.3733 

5.18 

Interaction 

3126.26 

6 

521.0433 

1.63 

Error 

3833.50 

12 

319.4583 


Total 

17,116.96 

23 
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Decision: (a) Reject H 0 ; (b) Reject H g ; (c) Do not reject Hq . 

14.2 The hypotheses of the three parts are, 

(a) for the main effects brands, 

Hq : tti = 0:2 = «3 = 0, 

H 1 : At least one of the cq’ s is not zero; 

(b) for the main effects times, 

H 0 : fly — fJ‘2 — fa — 0, 

H l : At least one of the /3j’s is not zero; 

(c) and for the interactions, 

H 0 : (a/3) ii = (a/3) is = • • • = (0/3)33 — 0, 

H l : At least one of the (a/3)j/ s is not zero. 


o = 0.05. 

Critical regions: (a) f\ > 3.35; (b) / 2 > 3.35; and (c) f 3 > 2.73. 
Computations: From the computer printout we have 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Brands 

32.7517 

2 

16.3758 

1.74 

Times 

227.2117 

2 

113.6058 

12.04 

Interaction 

17.3217 

4 

4.3304 

0.46 

Error 

254.7025 

27 

9.4334 


Total 

531.9875 

35 




Decision: (a) Do not reject H' 0 ] (b) Reject Hq ; (c) Do not reject Hq . 
14.3 The hypotheses of the three parts are, 

(a) for the main effects environments, 

H' 0 : Oi = 02 = 0, (no differences in the environment) 
H 1 : At least one of the Oj’s is not zero; 

(b) for the main effects strains, 

H 0 ’■ Pi — fh = As — 0, (no differences in the strains) 
H l : At least one of the /3j’s is not zero; 
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(c) and for the interactions, 

H 0 : (a/3) 11 = (a/3) 12 = • • • = (a/3 ) 2 3 = 0, (environments and strains do not interact) 
H 1 : At least one of the (a/3)j/ s is not zero, 
a = 0 . 01 . 

Critical regions: (a) f\ > 7.29; (b) f 2 > 5.16; and (c) / 3 > 5.16. 

Computations: From the computer printout we have 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Environments 

14,875.521 

1 

14,875.521 

14.81 

Strains 

18,154.167 

2 

9,077.083 

9.04 

Interaction 

1,235.167 

2 

617.583 

0.61 

Error 

42, 192.625 

42 

1004.586 


Total 

76,457.479 

47 




Decision: (a) Reject H' {) \ (b) Reject H"p. (c) Do not reject Hq . Interaction is not 
significant, while both main effects, environment and strain, are all significant. 

14.4 (a) The hypotheses of the three parts are, 

Hq : Qu — g ?2 = a 3 — 0 

H 1 : At least one of the ctj’s is not zero; 

H'o :{h = fo = fo = 0 , 

H 1 : At least one of the /3j’s is not zero; 

H 0 : (a/3) 11 = (a/3) 12 = • • • = (a/3 ) 33 = 0, 

H 1 : At least one of the (a/3)y’ s is not zero. 


a = 0 . 01 . 

Critical regions: for H' 0 , f\ > 3.21; for Hq, f 2 > 3.21; and for Hq , f 3 > 2.59. 
Computations: From the computer printout we have 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Coating 

1,535,021.37 

2 

767,510.69 

6.87 

Humidity 

1,020,639.15 

2 

510,319.57 

4.57 

Interaction 

1,089,989.63 

4 

272,497.41 

2.44 

Error 

5,028,396.67 

45 

111,742.15 


Total 

76,457.479 

47 




Decision: Reject H' 0 ; Reject Do not reject Hq . Coating and humidity do not 
interact, while both main effects are all significant. 
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(b) The three means for the humidity are ijl = 733.78, yM = 406.39 and ijh = 638.39. 
Using Duncan’s test, the means can be grouped as 

IJm Vl IJh 
406.39 638.39 733.78 

Therefore, corrosion damage is different for medium humidity than for low or high 
humidity. 

14.5 The hypotheses of the three parts are, 

(a) for the main effects subjects, 

H 0 : oil = 012 = CI3 = 0, 

H 1 : At least one of the ad s is not zero; 

(b) for the main effects muscles, 

H 0 : /3i = / 3 2 = fiz = /?4 = /?5 = 0 , 

H l : At least one of the /3j’s is not zero; 

(c) and for the interactions, 

H 0 : (a/3) 11 = (a/3) 12 = • • • = ( 0 / 3)35 = 0, 

H 1 : At least one of the (a/3)j/s is not zero. 

a = 0 . 01 . 

Critical regions: (a) /j > 5.39; (b) / 2 > 4.02; and (c) / 3 > 3.17. 

Computations: From the computer printout we have 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

Subjects 

4,814.74 

2 

2,407.37 

34.40 

Muscles 

7, 543.87 

4 

1,885.97 

26.95 

Interaction 

11,362.20 

8 

1,420.28 

20.30 

Error 

2,099.17 

30 

69.97 


Total 

25,819.98 

44 




Decision: (a) Reject Ff' 0 ; (b) Reject H ((; (c) Reject Hq . 
14.6 The AN OVA table is shown as 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Additive 

1.7578 

1 

1.7578 

22.29 

< 0.0001 

Temperature 

0.8059 

3 

0.2686 

3.41 

0.0338 

Interaction 

1.7934 

3 

0.5978 

7.58 

0.0010 

Error 

1.8925 

24 

0.0789 



Total 

6.2497 

32 
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Decision: All main effects and interaction are significant. 
An interaction plot is given here. 
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14.7 The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Temperature 

430.475 

3 

143.492 

10.85 

0.0003 

Catalyst 

2,466.650 

4 

616.663 

46.63 

< 0.0001 

Interaction 

326.150 

12 

27.179 

2.06 

0.0745 

Error 

264.500 

20 

13.225 



Total 

3,487.775 

39 





Decision: All main effects are significant and the interaction is significant at level 
0.0745. Hence, if 0.05 significance level is used, interaction is not significant. 

An interaction plot is given here. 



0.5 0.6 0.7 0.8 0.9 


Amount of Catalyst 


Duncan’s tests, at level 0.05, for both main effects result in the following, 
(a) For Temperature: 
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(b) For Amount of Catalyst: 

2 / 0.5 2 / 0.6 2 / 0.9 2 / 0.7 2 / 0.8 

44.125 56.000 58.125 64.625 66.250 


14.8 (a) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Nickel 

31,250.00 

1 

31,250.00 

44.52 

< 0.0001 

pH 

6,606.33 

2 

3,303.17 

4.71 

0.0310 

Nickel* pH 

670.33 

2 

335.17 

0.48 

0.6316 

Error 

8,423.33 

12 

701.94 



Total 

3,487.775 

39 





Decision: Nickel contents and levels of pH do not interact to each other, while 
both main effects of nickel contents and levels of pH are all significant, at level 
higher than 0.0310. 

(b) In comparing the means of the six treatment combinations, a nickel content level 
of 18 and a pH level of 5 resulted in the largest values of thickness. 

(c) The interaction plot is given here and it shows no apparent interactions. 



pH 


14.9 (a) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Tool 

675.00 

1 

675.00 

74.31 

< 0.0001 

Speed 

12.00 

1 

12.00 

1.32 

0.2836 

TooFSpeed 

192.00 

1 

192.00 

21.14 

0.0018 

Error 

72.67 

8 

9.08 



Total 

951.67 

11 





Decision: The interaction effects are significant. Although the main effects of 
speed showed insignificance, we might not make such a conclusion since its effects 
might be masked by significant interaction. 
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(b) In the graph shown, we claim that the cutting speed that results in the longest 
life of the machine tool depends on the tool geometry, although the variability of 
the life is greater with tool geometry at level 1. 



i 


2 


Tool Geometry 


(c) Since interaction effects are significant, we do the analysis of variance for separate 
tool geometry. 

(i) For tool geometry 1, an /-test for the cutting speed resulted in a P-value = 
0.0405 with the mean life (standard deviation) of the machine tool at 33.33 
(4.04) for high speed and 23.33 (4.16) for low speed. Hence, a high cutting 
speed has longer life for tool geometry 1. 

(ii) For tool geometry 2, an /-test for the cutting speed resulted in a P-value = 
0.0031 with the mean life (standard deviation) of the machine tool at 10.33 
(0.58) for high speed and 16.33 (1.53) for low speed. Hence, a low cutting 
speed has longer life for tool geometry 2. 

For the above detailed analysis, we note that the standard deviations for the mean 
life are much higher at tool geometry 1. 

(d) See part (b). 


14.10 (a) y ijk = p + a* + (3j + (a/?)„ + e ijk , i = 1, 2, . . . , a; / = 1, 2, . . . , 6; k = 1, 2, . . . , n. 

(b) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Dose 

117.9267 

1 

117.9267 

18.08 

0.0511 

Position 

15.0633 

2 

7.5317 

1.15 

0.4641 

Error 

13.0433 

2 

6.5217 



Total 

146.0333 

5 





(c) (n — 1) — (a — 1) — (b — 1) = 5 — 1 — 2 = 2. 

(d) At level 0.05, Tukey’s result for the furnace position is shown here: 


V2 Vi V3 

19.850 21.350 23.700 
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Although Tukey’s multiple comparisons resulted in insignificant differences among 
the furnace position levels, based on a P-value of 0.0511 for the Dose and on the 
plot shown we can see that Dose=2 results in higher resistivity. 
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14.11 (a) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Method 

0.000104 

1 

0.000104 

6.57 

0.0226 

Lab 

0.008058 

6 

0.001343 

84.70 

< 0.0001 

Method*Lab 

0.000198 

6 

0.000033 

2.08 

0.1215 

Error 

0.000222 

14 

0.000016 



Total 

0.00858243 

27 





(b) Since the P-valne = 0.1215 for the interaction, the interaction is not significant. 
Hence, the results on the main effects can be considered meaningful to the scien- 
tist. 

(c) Both main effects, method of analysis and laboratory, are all significant. 

(d) The interaction plot is show here. 


d 


o 
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1 2 3 4 5 6 7 


Lab 



When the tests are done separately, i.e., we only use the data for Lab 1, or Lab 
2 alone, the P-values for testing the differences of the methods at Lab 1 and 7 
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are 0.8600 and 0.1557, respectively. In this case, usually the degrees of freedom 
of errors are small. If we compare the mean differences of the method within the 
overall AN OVA model, we obtain the P- values for testing the differences of the 
methods at Lab 1 and 7 as 0.9010 and 0.0093, respectively. Hence, methods are 
no difference in Lab 1 and are significantly different in Lab 7. Similar results may 
be found in the interaction plot in (d). 


14.12 


(a) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Time 

0.025622 

2 

0.012811 

38.87 

< 0.0001 

Copper 

0.008956 

2 

0.004478 

13.58 

0.0003 

Time*Copper 

0.012756 

4 

0.003189 

9.67 

0.0002 

Error 

0.005933 

18 

0.000330 



Total 

0.053267 

26 





(b) The P-value < 0.0001. There is a significant time effect. 

(c) The P-value = 0.0003. There is a significant copper effect. 

(d) The interaction effect is significant since the P-value = 0.0002. The interaction 
plot is show here. 



i 


3 


Copper Content 


The algae concentrations for the various copper contents are all clearly influenced 
by the time effect shown in the graph. 


14.13 (a) The interaction plot is show here. There seems no interaction effect. 



1 2 


Treatment 
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(b) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Treatment 

0.060208 

1 

0.060208 

157.07 

< 0.0001 

Time 

0.060208 

1 

0.060208 

157.07 

< 0.0001 

Treat rnent * Time 

0.000008 

1 

0.000008 

0.02 

0.8864 

Error 

0.003067 

8 

0.000383 



Total 

0.123492 

11 





(c) The magnesium uptake are lower using treatment 2 than using treatment 1, no 
matter what the times are. Also, time 2 has lower magnesium uptake than time 
1. All the main effects are significant. 

(d) Using the regression model and making “Treatment” as categorical, we have the 
following fitted model: 

y = 2.4433 — 0.13667 Treatment — 0.13667 Time — 0.00333Treatment x Time. 


(e) The P-value of the interaction for the above regression model is 0.8864 and hence 
it is insignificant. 

14.14 (a) A natural linear model with interaction would be 

V — Pi 0 + Pl^l + P2X2 + Pl2%l%2- 

The fitted model would be 

y = 0.41772 - 0.06631xi - 0.00866x 2 + 0.00416xix 2 , 

with the P-values of the f-tests on each of the coefficients as 0.0092, 0.0379 and 
0.0318 for xi, x 2 , and XiX 2 , respectively. They are all significant at a level larger 
than 0.0379. Furthermore, K 2 ad - = 0.1788. 

(b) The new fitted model is 

y = 0.3368 - 0.15965a;i + 0.02684x 2 + 0.00416xix 2 + 0.02333x^ - 0.00155:4 

with P-values of the t-tests on each of the coefficients as 0.0004, < 0.0001, 0.0003, 
0.0156, and < 0.0001 for xi, x 2 . X\X 2i xf, and x|, respectively. Furthermore, 
P^dj = 0.7700 which is much higher than that of the model in (a). Model in (b) 
would be more appropriate. 


14.15 The ANOVA table is given here. 
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Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Main effect 
A 

2.24074 

1 

2.24074 

0.54 

0.4652 

B 

56.31815 

2 

28.15907 

6.85 

0.0030 

C 

17.65148 

2 

8.82574 

3.83 

0.1316 

Two- fact or 
Interaction 
AB 

31.47148 

2 

15.73574 

3.83 

0.0311 

AC 

31.20259 

2 

15.60130 

3.79 

0.0320 

BC 

2156074 

4 

5.39019 

1.31 

0.2845 

Three-factor 

Interaction 

ABC 

26.79852 

4 

6.69963 

1.63 

0.1881 

Error 

148.04000 

36 

4.11221 



Total 

335.28370 

53 





(a) Based on the P- values, only AB and AC interactions are significant. 

(b) The main effect B is significant. However, due to significant interactions men- 
tioned in (a), the insignificance of A and C cannot be counted. 

(c) Look at the interaction plot of the mean responses versus C for different cases of 
A. 



1 2 3 

C 


Apparently, the mean responses at different levels of C varies in different patterns 
for the different levels of A. Hence, although the overall test on factor C is 
insignificant, it is misleading since the significance of the effect C is masked by 
the significant interaction between A and C. 


14.16 (a) When only A, B, C , and BC factors are in the model, the P - value for BC 
interaction is 0.0806. Hence at level of 0.05, the interaction is insignificant. 

(b) When the sum of squares of the BC term is pooled with the sum of squares of 
the error, we increase the degrees of freedom of the error term. The P-values of 
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the main effects of A, B , and C are 0.0275, 0.0224, and 0.0131, respectively. All 
these are significant. 

14.17 Letting A, B , and C designate coating, humidity, and stress, respectively, the ANOVA 
table is given here. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Main effect 
A 

216,384.1 

1 

216,384.1 

0.05 

0.8299 

B 

19,876,891.0 

2 

9,938,445.5 

2.13 

0.1257 

C 

427,993,946.4 

2 

213,996,973.2 

45.96 

< 0.0001 

Two-factor 

Interaction 

AB 

31,736,625 

2 

15,868,312.9 

3.41 

0.0385 

AC 

699,830.1 

2 

349,915.0 

0.08 

0.9277 

BC 

58,623,693.2 

4 

13,655,923.3 

3.15 

0.0192 

Three-factor 

Interaction 

ABC 

36,034,808.9 

4 

9,008,702.2 

1.93 

0.1138 

Error 

335,213,133.6 

72 

4,655,738.0 



Total 

910,395,313.1 

89 





(a) The Coating and Humidity interaction, and the Humidity and Stress interaction 
have the P-values of 0.0385 and 0.0192, respectively. Hence, they are all signif- 
icant. On the other hand, the Stress main effect is strongly significant as well. 
However, both other main effects, Coating and Humidity, cannot be claimed as 
insignificant, since they are all part of the two significant interactions. 

(b) A Stress level of 20 consistently produces low fatigue. It appears to work best 
with medium humidity and an uncoated surface. 

14.18 The ANOVA table is given here. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

1.90591 

3 

0.63530 

8.28 

0.0003 

B 

0.02210 

1 

0.02212 

0.29 

0.5951 

C 

38.93402 

1 

38.93402 

507.57 

< 0.0001 

AB 

0.88632 

3 

0.29544 

3.85 

0.0185 

AC 

0.53594 

3 

0.17865 

2.33 

0.0931 

BC 

0.45435 

1 

0.45435 

5.92 

0.0207 

ABC 

0.42421 

3 

0.14140 

1.84 

0.1592 

Error 

2.45460 

32 

0.07671 



Total 

45.61745 

47 
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(a) Two-way interactions of AB and BC are all significant and main effect of A and 
C are all significant. The insignificance of the main effect B may not be valid due 
to the significant BC interaction. 

(b) Based on the P-values, Duncan’s tests and the interaction means, the most im- 
portant factor is C and using C = 2 is the most important way to increase percent 
weight. Also, using factor A at level 1 is the best. 

14.19 The AN OVA table shows: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

0.16617 

2 

0.08308 

14.22 

< 0.0001 

B 

0.07825 

2 

0.03913 

6.70 

0.0020 

C 

0.01947 

2 

0.00973 

1.67 

0.1954 

AB 

0.12845 

4 

0.03211 

5.50 

0.0006 

AC 

0.06280 

4 

0.01570 

2.69 

0.0369 

BC 

0.12644 

4 

0.03161 

5.41 

0.0007 

ABC 

0.14224 

8 

0.01765 

3.02 

0.0051 

Error 

0.47323 

81 

0.00584 



Total 

1.19603 

107 





There is a significant three-way interaction by Temperature, Surface, and Hrc. A plot 
of each Temperature is given to illustrate the interaction 



Hrc 
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14.20 (a) y ijk = /i + a { + (3j + y*, + {Pi)jk + = 0, ^7 fc = 0, Y^{Pi)jk = 0, 

3 k j 

YXPl )jk = 0, and e ijk ~ n(x\ 0, cr 2 ). 

k 

(b) The P-value of the Method and Type of Gold interaction is 0.10587. Hence, the 
interaction is at least marginally significant. 

(c) The best method depends on the type of gold used. 

The tests of the method effect for different type of gold yields the P-values as 
0.9801 and 0.0099 for “Gold Foil” and “Goldent”, respectively. Hence, the meth- 
ods are significantly different for the “Goldent” type. 

Here is an interaction plot. 



1 2 3 

Method 


It appears that when Type is “Goldent” and Method is 1, it yields the best 
hardness. 

14.21 (a) Yes, the P-values for Brand * Type and Brand * Temp are both < 0.0001. 

(b) The main effect of Brand has a P-value < 0.0001. So, three brands averaged 
across the other two factore are significantly different. 

(c) Using brand Y, powdered detergent and hot water yields the highest percent 
removal of dirt. 

14.22 (a) Define A, P, and C as “Powder Temperature,” “Die Temperature,” and “Extru- 

sion Rate,” respectively. The AN OVA table shows: 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

78.125 

1 

78.125 

125.00 

0.0079 

B 

3570.125 

1 

3570.125 

5712.20 

0.0002 

C 

2211.125 

1 

2211.125 

3537.80 

0.0003 

AB 

0.125 

1 

0.125 

0.20 

0.6985 

AC 

1.125 

1 

1.125 

1.80 

0.3118 

Error 

1.25 

2 

0.625 



Total 

5861.875 

7 
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The ANOVA results only show that the main effects are all significant and no 
two-way interactions are significant. 

(b) The interaction plots are shown here. 



Powder Temperature Powder Temperature 


(c) The interaction plots in part (b) are consistent with the findings in part (a) that 
no two-way interactions present. 

14.23 (a) The P - values of two-way interactions Timex Temperature, Timex Solvent, 
Temperature x Solvent, and the P - value of the three-way interaction 
Timex Temperature x Solvent are 0.1103, 0.1723, 0.8558, and 0.0140, respectively. 

(b) The interaction plots for different levels of Solvent are given here. 

Solvent=Ethanol Solvent=Toluene 



Time Time 

(c) A normal probability plot of the residuals is given and it appears that normality 
assumption may not be valid. 

Normal Q-Q Plot 



- 2-10 1 2 


Theoretical Quantiles 








228 


Chapter 14 Factorial Experiments (Two or More Factors) 


14.24 (a) The two-way interaction plots are given here and they all show significant inter- 

actions. 



Hardness 


(b) The normal probability plot of the residuals is shown here and it appears that 
normality is somewhat violated at the tails of the distribution. 

Normal Q-Q Plot 



- 2-10 1 2 
Theoretical Quantiles 


14.25 The ANOVA table is given. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Filters 

4.63389 

2 

2.31694 

8.39 

0.0183 

Operators 

10.31778 

3 

3.43926 

12.45 

0.0055 

Interaction 

1.65722 

6 

0.27620 

1.49 

0.2229 

Error 

4.44000 

24 

0.18500 



Total 

21.04889 

35 
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Note the / values for the main effects are using the interaction term as the denominator. 

(a) The hypotheses are 

Ho ■ vie = 0 , 

Hi : cr 2 Q/3 7 ^ 0 . 

Decision: Since P-value = 0.2229, the null hypothesis cannot be rejected. There 
is no significant interaction variance component. 

(b) The hypotheses are 

H' 0 : *1 = 0. = 0. 

H'c<4 0 . H'l : 0 . 

Decisions: Based on the P-values of 0.0183, and 0.0055 for H' 0 and H[, respec- 
tively, we reject both H 0 and H 0 . Both tr 2 and cr| are significantly different from 
zero. 

(c) d 2 = s 2 = 0.185; d 2 = 2 - 3169 1 1 ~ a185 = 0.17766, and a } = 3 - 43926 9 ~ 0 185 = 0.35158. 

14.26 d 2 = ( 42 -6289/2- 14.8011/4 = L4 g 7 g Brand 
d 2 = ( 299 ' 342 2/ 2 ~ 14-801 1/4 = 12 1642 Time . 

s 2 = 0.9237. 

14.27 The AN OVA table with expected mean squares is given here. 


Source of Degrees of Mean (a) (b) 

Variation Freedom Square Computed / Computed / 


A 

3 

140 

fi = 

s\/sl = 

= 5.83 

fi = 

4/4 

= 5.83 

B 

1 

480 

h = 

4/4i 

= 78.82 

h = 

4/4 

= 26.67 

C 

2 

325 

h 

si/ s\ - 

= 13.54 

fo = 

4/4 

= 13.54 

AB 

3 

15 

/: 

4/4? 

= 2.86 

u = 

4/4 

= 7.50 

AC 

6 

24 

fr> = 

s V s2 P 2 

= 4.57 

/■5 = 

4/4 

= 12.00 

BC 

2 

18 

u = 

sl/s 2 pi 

= 4.09 

fe = 

4/4 

= 9.00 

ABC 

6 

2 

fl = 

sVs 2 = 

= 0.40 

h — 

4/s 2 

= 0.40 

Error 

24 

5 








Total 47 


In column (a) we have found the following main effects and interaction effects significant 

using the pooled estimates: <Xg, u 2 , and (X 2 r 

s 2 pi = (12 + 120)/30 = 4.4 with 30 degrees of freedom. 

s 2 2 = (12 + 120 + 36)/32 = 5.25 with 32 degrees of freedom. 

s 2 3 = (12 + 120 + 36 + 45)/35 = 6.09 with 35 degrees of freedom. 

In column (b) we have found the following main effect and interaction effect significant 
when sums of squares of insignificant effects were not pooled: tr 2 and <r 2 r 
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4 j 

14.28 Y) qf = 0.24 and (j) = J [ 3 ^°^] = 2.55. With a = 0.05, v\ = 2 and V 2 = 39 we find 
1=1 * 

from A. 16 that the power is approximately 0.975. Therefore, 2 observations for each 
treatment combination are sufficient. 

14.29 The power can be calculated as 


l-^P F(2.6)>A 05 (2.6) g2 ;;; 3 ^ 1 


(5.14)(0.2762) 

2.3169 


= P[F( 2,6) > 0.6127] = 0.57. 


14.30 (a) A mixed model. Inspectors (cq in the model) are random effects. Inspection level 

(/ 3j in the model) is a fixed effect. 

Vijk = p + Oti + (3j + ( Ol(3)ij + 6ijk', 

Oii ~ n(x ; 0, a 2 ), (a/3)p- ~ n(x; 0, a 2 afi ) , ~ n(x] 0, a 2 ), ^ dj = 0. 

j 

(b) The hypotheses are 

Ho : a 2 = 0. 77o : = 0- Ho : /?i = = /?3 = 0, 

Hi '■ &a 0- H\ : cr 2 ^ 7 ^ 0 : At least one /^’s is not 0 . 

f- 2, 36 = 0.02 with P-value = 0.9806. There does not appear to be an effect due to 
the inspector. 

/ 4i 36 = 0.04 with P-value = 0.9973. There does not appear to be an effect due to 
the inspector by inspector level. 

f‘ 2,4 = 54.77 with P-value = 0.0012. Mean inspection levels were significantly 
different in determining failures per 1000 pieces. 

14.31 (a) A mixed model. 

(b) The ANOVA table is 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Material 

1.03488 

2 

0.51744 

47.42 

< 0.0001 

Brand 

0.60654 

2 

0.30327 

1.73 

0.2875 

Material* Brand 

9,70109 

4 

0.17527 

16.06 

0.0004 

Error 

0.09820 

9 

0.01091 



Total 

2.44071 

17 





(c) No, the main effect of Brand is not significant. An interaction plot is given. 
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CD 
LO 

LO 

CJ 
LO 

O 
LO 

co 
CD 

ABC 
Brand 

Although brand A has highest means in general, it is not always significant, 
especially for Material 2. 

14.32 (a) Operators (cq) and time of day {fl 3 ) are random effects. 

Uijk — n + Oii + (3j + + eijk] 

~ n(x] 0, a 2 ), fa ~ n(x ; 0, a 2 0 ), (ap) i;j ~ n(x ; 0, u 2 ^), e ijfc ~ n(x ; 0, a 2 ). 

(b) cr 2 = <r| = 0 (both estimates of the variance components were negative). 

(c) The yield does not appear to depend on operator or time. 

14.33 (a) A mixed model. Power setting (cq in the model) is a fixed effect. Cereal type (/3.,- 

in the model) is a random effect. 

Uijk = H + Cq + Pj + (aP)ij + 6ijk', 

^cq = 0, Pj ~ n(x-,0, cr 2 p), (aP)ij ~ n(x; 0, a 2 ^), ~ n(z; 0, cr 2 ). 

i 

(b) No. / 2,4 = 1.37 and P-value = 0.3524. 

(c) No. The estimate of a 2 is negative. 

14.34 (a) The ANOVA table is given. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Sweetener 

0.00871 

3 

0.00290 

2.90 

0.0670 

Flour 

0.00184 

1 

0.00184 

1.84 

0.1941 

Interaction 

0.01015 

3 

0.00338 

3.38 

0.0442 

Error 

0.01600 

16 

0.00100 



Total 

0.03670 

23 





Sweetener factor is close to be significant, while the P-value of the Flour shows 
insignificance. However, the interaction effects appear to be significant. 
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(b) Since the interaction is significant with a P-value = 0.0442, testing the effect 
of sweetener on the specific gravity of the cake samples by flour type we get 
P-value = 0.0077 for “All Purpose” flour and P-value = 0.6059 for “Cake” flour. 
We also have the interaction plot which shows that sweetener at 100% concen- 
tration yielded a specific gravity significantly lower than the other concentrations 
for all-purpose flour. For cake flour, however, there were no big differences in the 
effect of sweetener concentration. 



0 50 75 100 


Sweetener 


14.35 (a) The ANOVA table is given. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Sauce 

1,031.3603 

1 

1,031.3603 

7.32 

0.0123 

Fish 

16,505.8640 

2 

8,252.9320 

58.58 

< 0.0001 

Sauce* Fish 

724.6107 

2 

362.3053 

2.57 

0.0973 

Error 

3,381.1480 

24 

140.8812 



Total 

21,642.9830 

29 





Interaction effect is not significant. 

(b) Both P- values of Sauce and Fish Type are all small enough to call significance. 
14.36 (a) The ANOVA table is given here. 


Source of 

Sum of 

Degrees of 

Mean 

Computed 


Variation 

Squares 

Freedom 

Square 

/ 

P-value 

Plastic Type 

142.6533 

2 

71.3267 

16.92 

0.0003 

Humidity 

143.7413 

3 

47.9138 

11.36 

0.0008 

Interaction 

133.9400 

6 

22.3233 

5.29 

0.0070 

Error 

50.5950 

12 

4.2163 



Total 

470.9296 

23 




The interaction is 

significant. 






(b) The SS for AB with only Plastic Type A and B is 24.8900 with 3 degrees of 
freedom. Hence / = 2 3 = 1-97 with P-value = 0.1727. Hence, there is no 

significant interaction when only A and B are considered. 
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(c) The SS for the single-degree-of-freedom contrast is 143.0868. Hence / = 33.94 
with P-value < 0.0001. Therefore, the contrast is significant. 

(d) The SS for Humidity when only C is considered in Plastic Type is 2.10042. So, 
/ = 0.50 with P-value = 0.4938. Hence, Humidity effect is insignificant when 
Type C is used. 

14.37 (a) The ANOVA table is given here. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Environment 

0.8624 

1 

0.8624 

15.25 

0.0021 

Stress 

40.8140 

2 

20.4020 

360.94 

< 0.0001 

Interaction 

0.0326 

2 

0.0163 

0.29 

0.7547 

Error 

0.6785 

12 

0.0565 



Total 

42.3875 

17 





The interaction is insignificant. 

(b) The mean fatigue life for the two main effects are all significant. 

14.38 The ANOVA table is given here. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Sweetener 

1.26893 

3 

0.42298 

2.65 

0.0843 

Flour 

1.77127 

1 

1.77127 

11.09 

0.0042 

Interaction 

0.14647 

3 

0.04882 

0.31 

0.8209 

Error 

2.55547 

16 

0.15972 



Total 

5.74213 

23 





The interaction effect is insignificant. The main effect of Sweetener is somewhat in- 
significant, since the P-value = 0.0843. The main effect of Flour is strongly significant. 

14.39 The ANOVA table is given here. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 


1133.5926 

2 

566.7963 

8.30 

0.0016 

A 

26896.2593 

2 

13448.1296 

196.91 

< 0.0001 

B 

40.1482 

2 

20.0741 

0.29 

0.7477 

C 

216.5185 

4 

54.1296 

0.79 

0.5403 

AB 

1.6296 

4 

0.4074 

0.01 

0.9999 

AC 

2.2963 

4 

0.5741 

0.01 

0.9999 

Error 

2.5926 

8 

0.3241 

0.00 

1.0000 


1844.0000 

27 

68.2963 



Total 

30137.0370 

53 
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All the two-way and three-way interactions are insignificant. In the main effects, only 
A and B are significant. 

14.40 (a) Treating Solvent as a class variable and Temperature and Time as continuous 

variable, only three terms in the ANOVA model show significance. They are (1) 
Intercept; (2) Coefficient for Temperature and (3) Coefficient for Time. 

(b) Due to the factor that none of the interactions are significant, we can claim that 
the models for ethanol and toluene are equivalent apart from the intercept. 

(c) The three-way interaction in Exercise 14.23 was significant. However, the general 
patterns of the gel generated are pretty similar for the two Solvent levels. 

14.41 The ANOVA table is displayed. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Surface 

2.22111 

2 

1.11056 

0.02 

0.9825 

Pressure 

39.10778 

2 

19.55389 

0.31 

0.7402 

Interaction 

112.62222 

4 

28.15556 

0.45 

0.7718 

Error 

565.72000 

9 

62.85778 



Total 

719.67111 

17 





All effects are insignificant. 

14.42 (a) This is a two-factor fixed-effects model with interaction. 

Uijk = p + oii + (3j+)a(3)ij + €ijk, 

^ = 0, y, 0-j = 0, y(a;/3)jj = = 0, e ijk ~ n(x\ 0, a) 

i j i j 

(b) The ANOVA table is displayed. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Time 

0.16668 

3 

0.05556 

48.67 

< 0.0001 

Temperature 

0.27151 

2 

0.13575 

118.91 

< 0.0001 

Interaction 

0.03209 

6 

0.00535 

4.68 

0.0111 

Error 

0.01370 

12 

0.00114 



Total 

0.48398 

23 





The interaction is insignificant, while two main effects are significant. 

(c) It appears that using a temperature of — 20°C with drying time of 2 hours would 
speed up the process and still yield a flavorful coffee. It might be useful to try 
some additional runs at this combination. 
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14.43 ( 


( 


a) Since it is more reasonable to assume the data come from Poisson distribution, 
it would be dangerous to use standard analysis of variance because the normality 
assumption would be violated. It would be better to transform the data to get at 
least stable variance. 


The ANOVA table is displayed. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Teller 

40.45833 

3 

13.48611 

6.35 

0.0080 

Time 

97.33333 

2 

48.66667 

22.90 

< 0.0001 

Interaction 

8.66667 

6 

1.44444 

0.68 

0.6694 

Error 

25.50000 

12 

2.12500 



Total 

171.95833 

23 





The interaction effect is insignificant. Two main effects, Teller and Time, are all 
significant. 

c) The ANOVA table using a squared- root transformation on the response is given. 


Source of 
Variation 

Sum of 
Squares 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Teller 

0.05254 

3 

0.17514 

5.86 

0.0106 

Time 

1.32190 

2 

0.66095 

22.11 

< 0.0001 

Interaction 

0.11502 

6 

0.01917 

0.64 

0.6965 

Error 

0.35876 

12 

0.02990 



Total 

2.32110 

23 





Same conclusions as in (b) can be reached. To check on whether the assumption 
of the standard analysis of variance is violated, residual analysis may used to do 
diagnostics. 




Chapter 15 

2^ Factorial Experiments and 
Fractions 


15.1 Either using Table 15.5 (e.g., SSA = C 41 +5 1 -57-63+67+54-76+73d = 2 .6667) or running 
an analysis of variance, we can get the Sums of Squares for all the factorial effects. 

SSA = 2.6667, SSB = 170.6667, SSC = 104.1667, SS(AB) = 1.500. 
SS(AC) = 42.6667, SS(BC) = 0.0000, SS(ABC) = 1.5000. 

15.2 A simplified ANOVA table is given. 


Source of 
Variation 

Degrees of 
Freedom 

Computed 

/ 

P-value 

A 

1 

1294.65 

< 0.0001 

B 

1 

43.56 

0.0002 

AB 

1 

20.88 

0.0018 

C 

1 

116.49 

< 0.0001 

AC 

1 

16.21 

0.0038 

BC 

1 

0.00 

0.9668 

ABC 

1 

289.23 

< 0.0001 

Error 

8 



Total 

15 


All the main and interaction effects are significant, other than BC effect. However, 
due to the significance of the 3-way interaction, the insignificance of BC effect cannot 
be counted. Interaction plots are given. 
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15.3 The AD and BC interaction plots are printed here. The AD plot varies with levels of 
C since the AC D interaction is significant, or with levels of B since ABD interaction 
is significant. 

AD Interaction BC Interaction 



D 


C 


15.4 The ANOVA table is displayed. 


Source of 
Variation 

Degrees of 
Freedom 

Computed 

/ 

P-value 

A 

1 

57.85 

< 0.0001 

B 

1 

7.52 

0.0145 

AB 

1 

6.94 

0.0180 

C 

1 

127.86 

< 0.0001 

AC 

1 

7.08 

0.0171 

BC 

1 

10.96 

0.0044 

ABC 

1 

1.26 

0.2787 

D 

1 

44.72 

< 0.0001 

AD 

1 

4.85 

0.0427 

BD 

1 

4.85 

0.0427 

ABD 

1 

1.14 

0.3017 

CD 

1 

6.52 

0.0213 

ACD 

1 

1.72 

0.2085 

BCD 

1 

1.20 

0.2900 

ABCD 

1 

0.87 

0.3651 

Error 

16 



Total 

31 







Solutions for Exercises in Chapter 15 


239 


All main effects and two-way interactions are significant, while all higher order inter- 
actions are insignificant. 

15.5 The ANOVA table is displayed. 


Source of 
Variation 

Degrees of 
Freedom 

Computed 

/ 

P-value 

A 

1 

9.98 

0.0251 

B 

1 

0.20 

0.6707 

C 

1 

6.54 

0.0508 

D 

1 

0.02 

0.8863 

AB 

1 

1.83 

0.2338 

AC 

1 

0.20 

0.6707 

AD 

1 

0.57 

0.4859 

BC 

1 

19.03 

0.0073 

BD 

1 

1.83 

0.2338 

CD 

1 

0.02 

0.8863 

Error 

5 



Total 

15 


One two-factor interaction BC, which is the interaction of Blade Speed and Condition 
of Nitrogen, is significant. As of the main effects, Mixing time (A) and Nitrogen 
Condition (C) are significant. Since BC is significant, the insignificant main effect B , 
the Blade Speed, cannot be declared insignificant. Interaction plots for BC at different 
levels of A are given here. 

Speed and Nitrogen Interaction (Time=1) Speed and Nitrogen Interaction (Time=2) 



1 2 
Nitrogen Condition 



2 


Nitrogen Condition 


15.6 (a) The three effects are given as 

301 + 304- 269 -292 _ 301 + 269 - 304- 292 

w A = = 11, w B = = -6.5, 

301 - 304 - 269 + 292 
w A b = = 5. 

There are no clear interpretation at this time. 

(b) The ANOVA table is displayed. 
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Source of 
Variation 

Degrees of 
Freedom 

Computed 

/ 

P-value 

Concentration 

1 

35.85 

0.0039 

Feed Rate 

1 

12.52 

0.0241 

Interaction 

1 

7.41 

0.0529 

Error 

4 



Total 

7 


The interaction between the Feed Rate and Concentration is closed to be signifi- 
cant at 0.0529 level. An interaction plot is given here. 



Feed Rate 

The mean viscosity does not change much at high level of concentration, while it 
changes a lot at low concentration. 

(c) Both main effects are significant. Averaged across Feed Rate a high concentration 
of reagent yields significantly higher viscosity, and averaged across concentration 
a low level of Feed Rate yields a higher level of viscosity. 

15.7 Both AD and BC interaction plots are shown in Exercise 15.3. Here is the interaction 
plot of AB. 


AB Interaction 



For AD , at the high level of A, Factor D essentially has no effect, but at the low level 
of A, D has a strong positive effect. For BC, at the low level of B , Factor C has a 
strong negative effect, but at the high level of B, the negative effect of C is not as 
pronounced. For AB, at the high level of B , A clearly has no effect. At the low level 
of B, A has a strong negative effect. 


15.8 The two interaction plots are displayed. 
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AD Interaction (B=— 1) AD Interaction (B=1) 



-1 1-1 1 
A A 

It can be argued that when B — 1 that there is essentially no interaction between A 
and D. Clearly when B = — 1, the presence of a high level of D produces a strong 
negative effect of Factor A on the response. 

15.9 (a) The parameter estimates for xi, x 2 and XiX 2 are given as follows. 


Variable 

Degrees of Freedom 

Estimate 

/ 

P-value 

X\ 

1 

5.50 

5.99 

0.0039 

x 2 

1 

-3.25 

-3.54 

0.0241 

XiX 2 

1 

2.50 

2.72 

0.0529 


(b) The coefficients of bi, b 2 , and b± 2 are wa/ 2, wb/ 2, and wab/ 2, respectively. 

(c) The P-values are matched exactly. 

15.10 The effects are given here. 


A 

B 

C 

D 

AB 

AC 

AD 

BC 

-0.2625 

-0.0375 

0.2125 

0.0125 

-0.1125 

0.0375 

-0.0625 

0.3625 

BD 

CD 

ABC 

ABD 

ACD 

BCD 

ABCD 


0.1125 

0.0125 

-0.1125 

0.0375 

-0.0625 

0.1125 

-0.0625 



The normal probability plot of the effects is displayed. 

Normal Q-Q Plot 


Theoretical Quantiles 


(a) It appears that all three- and four-factor interactions are not significant. 

(b) From the plot, it appears that A and BC are significant and C is somewhat 
significant. 
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15.11 (a) The effects are given here and it appears that B, C, and AC are all important. 

A B CAB AC BC ABC 

-0.875 5.875 9.625 -3.375 -9.625 0.125 -1.125 

(b) The ANOVA table is given. 


(c) Yes, they do agree. 

(d) For the low level of Cutting Angle, C, Cutting Speed, A, has a positive effect on 
the life of a machine tool. When the Cutting Angle is large, Cutting Speed has a 
negative effect. 

15.12 A is not orthogonal to BC, B is not orthogonal to AC, and C is not orthogonal to AB. 
If we assume that interactions are negligible, we may use this experiment to estimate 
the main effects. Using the data, the effects can be obtained as A: 1.5; B: —6.5; 
C: 2.5. Hence Factor B, Tool Geometry, seems more significant than the other two 
factors. 

15.13 Here is the block arrangement. 


1 

Block 

2 

Block 

1 

2 

Block 

1 

2 

(1) 


a 


(1) 

a 


(1) 

a 

c 


b 


c 

b 


c 

b 

ab 


ac 


ab 

ac 


ab 

ac 

abc 


be 


abc 

be 


abc 

be 

F 

feplicate 1 

Replicate 2 

Replicate 3 

AB Confounded 

AB Confounded 

AB Confounded 



AC Interaction 



-1 1 
A 
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Analysis of Variance 


Source of Variation 

Degrees of Freedom 

Blocks 

5 

A 

1 

B 

1 

C 

1 

AC 

1 

BC 

1 

ABC 

1 

Error 

12 

Total 

23 


15.14 (a) ABC is confounded with blocks in the first replication and ABCD is confounded 
with blocks in second replication. 

(b) Computing the sums of squares by the contrast method yields the following 
ANOVA table. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Blocks 

3 

2.32 



A 

1 

2.00 

3.34 

0.0907 

B 

1 

0.50 

0.83 

0.3775 

C 

1 

4.50 

7.51 

0.0168 

D 

1 

8.00 

13.36 

0.0029 

AB 

1 

0.50 

0.83 

0.3775 

AC 

1 

0.32 

0.53 

0.4778 

BC 

1 

0.50 

0.83 

0.3775 

AD 

1 

0.72 

1.20 

0.2928 

BD 

1 

0.32 

0.53 

0.4778 

CD 

1 

0.18 

0.30 

0.5928 

ABC 

1 

1.16 

1.93 

0.1882 

ABD 

1 

0.32 

0.53 

0.4778 

ACD 

1 

0.02 

0.03 

0.8578 

BCD 

1 

0.18 

0.30 

0.5928 

ABCD 

1 

0.53 

0.88 

0.3659 

Error 

13 

0.60 



Total 

31 


Only the main effects C and D are significant. 

15.15 L i = 71 + 72 + 73 and L 2 = 7i + 72 + 74- For treatment combination (1) we find 
L\ (mod 2) = 0. For treatment combination a we find L\ (mod 2) = 1 and L 2 (mod 2) = 
1. After evaluating L\ and L 2 for all 16 treatment combinations we obtain the following 
blocking scheme: 
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Block 1 Block 2 Block 3 Block 4 



Since ( ABC)(ABD ) = A 2 B 2 CD = CD (mod 2), then CD is the other effect con- 
founded. 

15.16 (a) L i = 71 + 72 + 74 + 75, L 2 = 71 + 75. We find that the following treatment combi- 
nations are in the principal block (Li = 0, L 2 = 0): (1), c, ae, bd, ace, abde, abcde. 
The other blocks are constructed by multiplying the treatment combinations in 
the principal block modulo 2 by a, b, and ab, respectively, to give the following 
blocking arrangement: 


Block 1 Block 2 Block 3 Block 4 



(b) ( ABDE)(AE ) = BD (mod 2). Therefore BD is also confounded with days. 

(c) Yates’ technique gives the following sums of squares for the main effects: 

SSA = 21.9453, SSB = 40.2753, SSC = 2.4753, 

SSD = 7.7028, SSE = 1.0878. 

15.17 Li = 71 + 72 + 73, L 2 = 71 + 72. 


Block 

1 2 


Bloc! 

1 

2 

Block 

1 2 

abc 

ab 


abc 

ab 


(1) 

a 

a 

ac 


a 

ac 


c 

b 

b 

be 


b 

be 


ab 

ac 

c 

(1) 


c 

(1) 


abc 

be 

Rep 1 


Rep S 


Rep 3 

ABC Confounded 

ABC Confounded 

AB Confounded 
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For treatment combination (1) we find L\ (mod 2) = 0 and L 2 (mod 2) = 0. For 
treatment combination a we find Li (mod 2) = 1 and L 2 (mod 2) = 1. Replicate 1 and 
Replicate 2 have L\ = 0 in one block and Li = 1 in the other. Replicate 3 has L 2 = 0 
in one block and L 2 = 1 in the other. 


Analysis of Variance 

Source of Variation Degrees of Freedom 


Blocks 

5 

A 

1 

B 

1 

C 

1 

AB 

1' 

AC 

1 

BC 

1 

ABC 

1' 

Error 

11 

Total 

23 


Relative information on ABC = | and relative information on AB = |. 
15.18 (a) The ANOVA table is shown here. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Operators 

1 

0.1225 



A 

1 

4.4100 

0.04 

0.2413 

B 

1 

3.6100 

1.55 

0.2861 

C 

1 

9.9225 

1.27 

0.0912 

D 

1 

2.2500 

3.49 

0.3945 

Error 

10 

2.8423 

0.79 


Total 

15 


None of the main effects is significant at 0.05 level. 

(b) ABC is confounded with operators since all treatments with positive signs in the 
ABC contrast are in one block and those with negative signs are in the other 
block. 

15.19 (a) One possible design would be: 


Machine 


1 

(1) 

ab 

ce 

abce 

acd 

bde 

ade 

bed 

2 

a 

b 

ace 

bee 

cd 

abde 

de 

abed 

3 

c 

abc 

e 

abe 

ad 

bede 

aede 

bd 

4 

d 

abd 

cde 

abode 

ac 

be 

ae 

be 


(b) ABD, CDE , and ABCE 
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15.20 (a) y = 43.9 + 1.625xi — 8.625x2 + 0.375x3 + 9.125 xiX2 + 0.625xiX 3 + 0.875x2X3. 

(b) The Lack-of-fit test results in a P-value of 0.0493. There are possible quadratic 
terms missing in the model. 

15.21 (a) The P-values of the regression coefficients are: 


Parameter 

Intercept 

Xl 


^3 

xix 2 

xix 3 

x 2 x 3 

X1X2X3 

P-value 

< 0.0001 

0.5054 

0.0772 

0.0570 

0.0125 

0.0205 

0.7984 

0.6161 


and s 2 = 0.57487 with 4 degrees of freedom. So x 2 , x 3 , X]X 2 and XiX 3 are impor- 
tant in the model. 

(b) t = , ^ = 52 07 5 ^ 4 ir 275 _ _ g Q 3 Qg Hence the P-value = 0.0038 for 

V ’ y/s*(l/n f +l/n c ) \J ( 0 . 57487 )(l/ 8 +l/ 4 ) 

testing quadratic curvature. It is significant. 

(c) Need one additional design point different from the original ones. 

15.22 (a) No. 

(b) It could be as follows. 


Machine 


1 

(1) 

ad 

be 

abce 

acd 

abd 

ede 

bde 

2 

a 

e 

abc 

bee 

cd 

bd 

aede 

abde 

3 

b 

abe 

c 

ace 

abed 

ad 

bede 

de 

4 

d 

ade 

bed 

abede 

ac 

ab 

ce 

be 


ADE, BCD and ABCE are confounded with blocks. 

(c) Partial confounding. 

15.23 To estimate the quadratic terms, it might be good to add points in the middle of the 
edges. Hence (—1,0), (0,-1), (1,0), and (0,1) might be added. 

15.24 The alias for each effect is obtained by multiplying each effect by the defining contrast 
and reducing the exponents modulo 2. 

A =CDE , AB =BCDE, BD=ABCE, B =ABCDE , AC =DE, 

BE=ABCD, C =ADE, AD=CE, ABC=BDE , D =ACE , 

AE=CD , ABD=BCE, E =ACD , BC =ABDE , ABE=BCD , 


15.25 (a) With BCD as the defining contrast, we have L = 72 + 73 + 74- The | fraction cor- 
responding to L = 0 (mod 2 is the principal block: {(1), a, be, abc, bd, abd, cd , acd}. 

(b) To obtain 2 blocks for the \ fraction the interaction ABC is confounded using 
L = 7i + 72 + 73 : 
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Block 1 Block 2 


(1) 


a 

be 


abe 

abd 


bd 

acd 


cd 


(c) Using BCD as the defining contrast we have the following aliases: 

A=ABCD , AB=ACD , B=CD , AC=ABD , 
C=BD, AD=ABC, D=BC. 


Since AD and ABC are confounded with blocks there are only 2 degrees of freedom 
for error from the unconfounded interactions. 


Analysis of Variance 


Source of Variation 

Degrees of Freedom 

Blocks 

1 

A 

1 

B 

1 

C 

1 

D 

1 

Error 

2 


Total 7 


15.26 With ABCD and BDEF as defining contrasts, we have 

L \ = 7i + 72 + 73 + 74, L -2 = 72 + 74 + 75 + 7e- 

The following treatment combinations give Li — 0, L 2 = 0 (mod 2) and thereby suffice 
as the j fraction: 

{(1), ac, bd, abed, abe, bee, ade, abf, bef, adf , cdf, ef, acef, bdef, abedef}. 

The third defining contrast is given by 

{ABCD) {BDEF) = AB 2 CD 2 EF = ACEF (mod 2). 

The effects that are aliased with the six main effects are: 


A=BCD =ABDEF=CEF, 
C=ABD =BCDEF=AEF, 
E=ABCDE=BDF =ACF, 


B=ACD =DEF=ABCEF, 
D=ABC =BEF=ACDEF, 
F=ABCDF=BDE=ACE. 
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15.27 (a) With ABCE and ABDF, and hence ( ABCE) (ABDF ) = CDEF as the defining 
contrasts, we have 

Li = 7i + 72 + 73 + 75, L 2 = 7i + 72 + 74 + 7e- 

The principal block, for which Li = 0, and L 2 = 0, is as follows: 

{(1), ab, acd, bed, , ce, abce, ade, bde , acf , bef, df, abdf , aef, bef , edef, abedef}. 


(b) The aliases for each effect are obtained by multiplying each effect by the three 
defining contrasts and reducing the exponents modulo 2. 


71 =ACDEF, 

C =ABE =ABCDF=DEF, 

E =ABC =ABDEF=CDF, 

AB =CE =DF =ABCDEF, 
AD =BCDE=BF =ACEF , 

AF =BCEF=BD =ACDE, 
DE =ABCD=ABEF =CF, 
DCF=AEF =ACD =BDE, 


B =ACE =ADF =BCDEF, 
D =ABCDE=ABF =CEF , 

F =ABCEF=ABD =CDE, 

AC =BE =BCDF=ADEF, 
AE =BC =BDEF=ACDF , 
CD =ABDE =ABCF=EF , 
BCD=ADE =ACF =BEF, 


Since E and F do not interact and all three-factor and higher interactions are 
negligible, we obtain the following ANOVA table: 


Source of Variation Degrees of Freedom 


A 

1 

B 

1 

C 

1 

D 

1 

E 

1 

F 

1 

AB 

1 

AC 

1 

AD 

1 

BC 

1 

BD 

1 

CD 

1 

Error 

3 

Total 

15 


15.28 The ANOVA table is shown here and the error term is computed by pooling all the 
interaction effects. Factor E is the only significant effect, at level 0.05, although the 
decision on factor G is marginal. 
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Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

1 

1.44 

0.48 

0.5060 

B 

1 

4.00 

1.35 

0.2793 

C 

1 

9.00 

3.03 

0.1199 

D 

1 

5.76 

1.94 

0.2012 

E 

1 

16.00 

5.39 

0.0488 

F 

1 

3.24 

1.09 

0.3268 

G 

1 

12.96 

4.36 

0.0701 

Error 

8 

2.97 



Total 

15 


15.29 All two-factor interactions are aliased with each other. So, assuming that two-factor 
as well as higher order interactions are negligible, a test on the main effects is given in 
the AN OVA table. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

1 

6.125 

5.81 

0.0949 

B 

1 

0.605 

0.57 

0.5036 

C 

1 

4.805 

4.56 

0.1223 

D 

1 

0.245 

0.23 

0.6626 

Error 

3 

1.053 



Total 

7 


Apparently no main effects is significant at level 0.05. Comparatively factors A and C 
are more significant than the other two. Note that the degrees of freedom on the error 
term is only 3, the test is not very powerful. 


15.30 Two-factor interactions are aliased with each other. There are total 7 two-factor in- 
teractions that can be estimated. Among those 7, we picked the three, which are AC, 
AF , and BD, that have largest SS values and pool the other 2-way interactions to the 
error term. An ANOVA can be obtained. 
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Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

1 

81.54 

0.37 

0.5643 

B 

1 

166.54 

0.76 

0.4169 

C 

1 

5.64 

0.03 

0.8778 

D 

1 

4.41 

0.02 

0.8918 

E 

1 

40.20 

0.18 

0.6834 

F 

1 

1678.54 

7.66 

0.0325 

AC 

1 

978.75 

4.47 

0.0790 

AF 

1 

625.00 

2.85 

0.1423 

BD 

1 

429.53 

1.96 

0.2111 

Error 

6 

219.18 



Total 

15 


Main effect F, the location of detection, appears to be the only significant effect. The 
AC interaction, which is aliased with BE , has a P-value closed to 0.05. 

15.31 To get all main effects and two-way interactions in the model, this is a saturated design, 
with no degrees of freedom left for error. Hence, we first get all SS of these effects and 
pick the 2-way interactions with large SS values, which are AD, AE, BD and BE. 
An ANOVA table is obtained. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

A 

1 

388,129.00 

3,585.49 

< 0.0001 

B 

1 

277,202.25 

2,560.76 

< 0.0001 

C 

1 

4,692.25 

43.35 

0.0006 

D 

1 

9,702.25 

89.63 

< 0.0001 

E 

1 

1,806.25 

16.69 

0.0065 

AD 

1 

1,406.25 

12.99 

0.0113 

AE 

1 

462.25 

4.27 

0.0843 

BD 

1 

1,156.25 

10.68 

0.0171 

BE 

1 

961.00 

8.88 

0.0247 

Error 

6 

108.25 



Total 

15 


All main effects, plus AD, BD and BE two-way interactions, are significant at 0.05 
level. 

15.32 Consider a 2 4 design with letters A, B, C , and D, with design points 

{(1), a, b, c, d, ab, ac, ad, be, bd, cd, abc, abd, acd, bed, abed} 

. Using E = ABCD, we have the following design: 


(e, a, b, c, d, abe, ace, ade, bee, bde, ede, abc, abd, acd, bed, abede}. 
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15.33 Begin with a 2 3 with design points 

{(1), a, b, c, ab, ac, be , abc}. 

Now, use the generator D = AB, E = AC, and F = BC. We have the following 
result: 

{def, af, be, cd, abd, ace, bef, abode f}. 

15.34 We can use the D = AB, E = —AC and F = BC as generators and obtain the result: 

{df, aef, b, ede, abde, ac, beef, abedf}. 


15.35 Here are all the aliases 


A=BD^CE=CDF=BEF= 
B=AD=CF=CDE=AEF= 
C=AE =BF=BDE=ADF= 
D=AB =EF=BCE=ACF = 
E=AC=DF=ABF=BCD= 
F=BC=DE=ACD=ABE= 


=ABCF =ADEF =ABCDE- 
= ABCE =BDEF =ABCDF ; 
=CDEF =ABCD =ABCEF ; 
=BCDF =ACDE =ABDEF\ 
=ABDE =BCEF =ACDEF ; 
= ACEF =ABDF =BCDEF. 


15.36 (a) The defining relation is ABC = — 

(b) A = -BC, B = -AC, and C = -AB. 

(c) The mean squares for A, B, and C are 1.50, 0.34, and 5.07, respectively. So, 
factor C , the amount of grain refiner, appears to be most important. 


(d) Low level of C. 

(e) All at the “low” level. 

(f) A hazard here is that the interactions may play significant roles. The following 
are two interaction plots. 

AB Interaction BC Interaction 



-1 


-1 
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15.37 When the variables are centered and scaled, the fitted model is 

y = 12.7519 + 4.7194x1 + 0.8656x 2 - 1.4156x 3 . 

The lack-of-fit test results in an /-value of 81.58 with P-value < 0.0001. Hence, higher- 
order terms are needed in the model. 

15.38 The ANOVA table for the regression model looks like the following. 


Coefficients 

Intercept 

Pi 

P2 

Ps 

Pa 

Pb 

Two-factor interactions 
Lack of fit 
Pure error 
Total 


Degrees of Freedom 

T 

1 

1 

1 

1 

1 

10 

16 

32 

63 


15.39 The defining contrasts are 

AFG, CEFG, ACDF, BEG , BDFG, CDG, BCDE , ABCDEFG , DEF, ADEG. 


15.40 Begin with the basic line for N = 24; permute as described in Section 15.12 until 18 
columns are formed. 

15.41 The fitted model is 

y =190, 056.67 + 181, 343.33xi + 40, 395.00x 2 + 16, 133.67x s + 45, 593.67x 4 
- 29, 412.33x5 + 8, 405.00 x 6 . 

The f-tests are given as 


Variable 

t 

P-value 

Intercept 

4.48 

0.0065 

Xi 

4.27 

0.0079 

X 2 

0.95 

0.3852 

X3 

0.38 

0.7196 

X 4 

1.07 

0.3321 

x 5 

-0.69 

0.5194 

x 6 

0.20 

0.8509 


Only Xi and x 2 are significant. 
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15.42 An AN OVA table is obtained. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Polymer 1 

1 

172.98 

1048.36 

< 0.0001 

Polymer 2 

1 

180.50 

1093.94 

< 0.0001 

Polymer 1* Polymer 2 

1 

1.62 

9.82 

0.0351 

Error 

4 

0.17 



Total 

7 


All main effects and interactions are significant. 

15.43 An ANOVA table is obtained. 


Source of 
Variation 

Degrees of 
Freedom 

Mean 

Square 

Computed 

/ 

P-value 

Mode 

1 

2,054.36 

74.25 

< 0.0001 

Type 

1 

4,805.96 

173.71 

< 0.0001 

Mode*Type 

1 

482.90 

17.45 

0.0013 

Error 

12 

27.67 



Total 

15 


All main effects and interactions are significant. 

15.44 Two factors at two levels each can be used with three replications of the experiment, 
giving 12 observations. The requirement that there must be tests on main effects 
and the interactions suggests that partial confounding be used The following design is 
indicated: 


Block 

1 

2 

1 

Block 

2 

1 

Block 

2 

(1) 

ab 


a 

b 


a 

ab 


(1) 

b 


(1) 

a 


ab 

a 

Rep 1 



Rep 2 


Rep 3 


15.45 Using the contrast method and compute sums of squares, we have 


Source of Variation 

d.f. 

MS 

/ 

A 

1 

0.0248 

1.24 

B 

1 

0.0322 

1.61 

C 

1 

0.0234 

1.17 

D 

1 

0.0676 

3.37 

E 

1 

0.0028 

0.14 

F 

1 

0.0006 

0.03 

Error 

8 

0.0201 
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15.46 With the defining contrasts ABCD, CDEFG, and BDF , we have 

Li = 71 + 72 + 73 + 74, 

L 2 = 73 + 74 + 75 + 76 + 77- 
L 3 = 72 + 74 + 7e- 

The principal block and the remaining 7 blocks are given by 


Block 1 Block 2 


(1), eg 


a, aeg 

abed, bdg 


bed, abdg 

adf, bef 


df, abef 

edef, abedeg 


aedef, bedeg 

bde, adefg 


abde, defg 

bcefg, cdfg 


abcefg, aedfg 

aeg, abef 


eg, bef 

ace, abfg 


ce, bfg 


Block 3 Block 4 


b, beg 


c, ceg 

acd, dg 


abd, bedg 

abdf, cf 


acdf, bf 

bedef, aedeg 


def, abdeg 

de, abdefg 


bcde, aedefg 

cefg, bedfg 


befg, dfg 

abcg, aef 


ag, abcef 

abce, afg 


ae, abefg 


Block 5 Block 6 


d, deg 


e, g 

abc, bg 


abede, bdeg 

af, bedf 


adef beef 

cef, abceg 


cdf abedg 

be, aefg 


bd, adfg 

bedefg, cfg 


befg, edefg 

aedg, abdef 


aceg, abf 

aede, abdfg 


ac, abefg 


Block 7 Block 8 


f, efg 


ab, abeg 

abedf, bdfg 


cd, adg 

ad, be 


bdf acf 

ede, abedefg 


abedef, edeg 

bdef adeg 


ade, bdefg 

bceg, edg 


acefg, abedfg 

acfg, abe 


beg, ef 

acef, abg 


bee, fg 


The two-way interactions AB = CD, AC = BD, AD = BC , BD = F, BF = D and 
F>F = B. 


15.47 A design (where Li = L 2 = L 3 = L 4 = 0 (mod 2) are used) is: 

{(1), abcg , abdh, abef, acdf, aceh, adeg , afgh, 
bcde, bcfh, bdfg, cdgh, cefg, defh, degh, abcdefgh} 


15.48 In the four defining contrasts, BCDE, ACDF, ABCG, and ABDH , the length of 
interactions are all 4. Hence, it must be a resolution IV design. 

15.49 Assuming three factors the design is a 2 3 design with 4 center runs. 

15.50 (a) Consider a 2 design with ABC = I as defining contrast. Then the design 

points are 
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gi X 2 x 3 

-1 -1 -1 

1-1 1 
-111 
1 1-1 
0 0 0 

0 0 0 

For the noncentral design points, x± = X2 = x 3 = 0 and x\ = x\ = x\ = 1 . Hence 

E(y/ — yo) = fto + ( 3 \X\ + @2X2 + ^3X3 + Pnx\ + P22X2 + ^33^3 — Po = P11 + P22 + (^33- 

(b) It is learned that the test for curvature that involves yj — y 0 actually is testing 
the hypothesis ( 3 n + / 3 22 + / 3 33 = 0 . 




Chapter 16 


Nonparametric Statistics 

16.1 The hypotheses 

H 0 : ft, = 20 minutes 
Hi : ft, > 20 minutes. 


a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: Subtracting 20 from each observation and discarding the zeroes. We 
obtain the signs 


- + + - + + - + + + 

for which n — 10 and x = 7. Therefore, the P - value is 

10 

P = P(X > 7 | p = 1/2) = b ( x > 10 > V 2 ) 

x=7 

6 

= 1 - b (x] 10, 1/2) = 1 - 0.8281 = 0.1719 > 0.05. 

Decision: Do not reject H 0 . 

16.2 The hypotheses 


H 0 : [i = 12 
Hi : fi ^ 12 . 


a = 0.02. 

Test statistic: binomial variable X with p — 1/2. 

Computations: Replacing each value above and below 12 by the symbol “+” and ” , 
respectively, and discarding the two values which equal to 12. We obtain the sequence 

- + -- + + + + - + + - + + - + 
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for which n = 16, x = 10 and n / 2 = 8. Therefore, the P-value is 

16 

P = 2 P(X > 10 | p= 1/2) = 2 K x > 16, 1/2) 

*=io 

9 

= 2(1 - ,:> (p 16 > !/2)) = 2(1 - 0.7728) = 0.4544 > 0.02. 

x=0 

Decision: Do not reject H 0 . 

16.3 The hypotheses 

H 0 : p = 2.5 
H i : p ^ 2.5. 

a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: Replacing each value above and below 2.5 by the symbol “+” and ' ”, 

respectively. We obtain the sequence 

for which n = 16, x = 3. Therefore, p — rip — (16) (0.5) = 8 and cr = ^/(16)(0.5)(0.5) = 
2. Hence z = (3.5 — 8)/2 = —2.25, and then 

P = 2 P(X < 3 | p = 1/2) « 2 P(Z < -2.25) = (2)(0.0122) = 0.0244 < 0.05. 

Decision: Reject H 0 . 

16.4 The hypotheses 

Hq ■ P l = P‘2 
Hi : pi < p 2 . 

a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: After replacing each positive difference by a “+” symbol and negative 
difference by a ” symbol, respectively, and discarding the two zero differences, we 
have n = 10 and x = 2. Therefore, the P-value is 

2 

P = P(X <2\p = 1/2) = b (p 10 ’ l / 2 ) = °- 0547 > °- 05 - 

X=§ 

Decision: Do not reject H 0 . 
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16.5 The hypotheses 

#o : Pi ~ P 2 = 4.5 
H l : pi - p 2 < 4.5. 

a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: We have n = 10 and x = 4 plus signs. Therefore, the P-value is 

4 

P = P(X < 4 I p = 1/2) = 10, 1/2) = 0.3770 > 0.05. 

x=0 

Decision: Do not reject Hq. 

16.6 The hypotheses 

Ho : Pa = Pb 
Hi : pA 7 ^ Pb- 

a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: We have n = 14 and x — 12. Therefore, p — np — (14) (1/2) = 7 and 
a = a/ ( 14) (1/2) (1/2) = 1.8708. Hence, z = (11.5 - 7)/1.8708 = 2.41, and then 

P = 2 P(X > 12 | p= 1/2) = 2 P(Z > 2.41) = (2)(0.0080) = 0.0160 < 0.05. 

Decision: Reject H 0 . 

16.7 The hypotheses 

Ho : p2 — pi = 8 
Hy : p 2 — p\ < 8. 

a = 0.05. 

Test statistic: binomial variable X with p — 1/2. 

Computations: We have n = 13 and x — 4. Therefore, p = np = (13) (1/2) = 6.5 and 
a = \J (13) (1/2) (1/2) = 1.803. Hence, z = (4.5 - 6.5)/1.803 = -1.11, and then 

P = P(X > 4 | p= 1/2) = P(Z < -1.11) = 0.1335 > 0.05. 

Decision: Do not reject H 0 . 

16.8 The hypotheses 

Ho : p = 20 
H 1 : p> 20. 

a = 0.05. 

Critical region: w< 11 for n = 10. 

Computations: 
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di 

-3 

12 

5 

-5 

8 

5 

-8 

15 

6 

4 

Rank 

1 

9 

4 

4 

7.5 

4 

7.5 

10 

6 

2 


Therefore, w= 12.5. 

Decision: Do not reject H 0 . 

16.9 The hypotheses 


H 0 : [i = 12 
H\ : p ^ 12. 


a = 0.02. 

Critical region: w<20 for n — 15. 
Computations: 


di 

-3 

1 

-2 -1 

6 

4 

1 

2 -1 

3 

-3 

1 

2 -1 

2 

Rank 

12 

3.5 

8.5 3.5 

15 

14 

3.5 

8.5 3.5 

12 

12 

3.5 

8.5 3.5 

8.5 


Now, w = 43 and w + = 77, so that w — 43. 
Decision: Do not reject H 0 . 

16.10 The hypotheses 


Hq '■ pi — p 2 — 0 
Hi : pi — p 2 < 0. 


a = 0.02. 

Critiral region: ui+ < 1 for n = 5. 
Computations: 


Pair 

1 

2 

3 

4 

5 

di 

-5 

-2 

1 

-4 

2 

Rank 

5 

2.5 

1 

4 

2.5 


Therefore, w + = 3.5. 
Decision: Do not reject H 0 . 

16.11 The hypotheses 


H 0 : pi - p 2 = 4.5 
Hi : pi - p 2 < 4.5. 


a = 0.05. 

Critiral region: w+ <11. 
Computations: 
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Woman 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

di 

-1.5 

5.4 

3.6 

6.9 

5.5 

2.7 

2.3 

3.4 

5.9 

0.7 

d% do 

-6.0 

0.9 

-0.9 

2.4 

1.0 

-1.8 

-2.2 

-1.1 

1.4 

-3.8 

Rank 

10 

1.5 

1.5 

8 

3 

6 

7 

4 

5 

9 


Therefore, w + = 17.5. 

Decision: Do not reject Hq. 

16.12 The hypotheses 

Ho : Pa~ Pb — 0 
H\ : Pa ~ Pb > 0. 

a = 0 . 01 . 

Critiral region: z > 2.575. 

Computations: 


Day 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

di 

2 

6 

3 

5 

8 

-3 

8 

1 

6 

-3 

Rank 

4 

15.5 

7.5 

13 

19.5 

7.5 

19.5 

1.5 

15.5 

7.5 

Day 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

di 

4 

6 

6 

2 

-4 

3 

7 

1 

-2 

4 

Rank 

11 

15.5 

15.5 

4 

11 

7.5 

18 

1.5 

4 

11 


Now w = 180, n = 20, p w+ = (20)(21)/4 = 105, and a w+ = N /(20)(21)(41)/24 = 
26.786. Therefore, z = (180 - 105)/26.786 = 2.80 

Decision: Reject H 0 ; on average, Pharmacy A fills more prescriptions than Pharmacy 
B. 

16.13 The hypotheses 


Ho : pi — P2 — 8 
Hi : pi - p 2 < 8. 


a = 0.05. 

Critiral region: z < —1.645. 
Computations: 


di 

6 

9 

3 

5 

8 

9 

4 

10 

di do 

-2 

1 

-5 

-3 

0 

1 

-4 

2 

Rank 

4.5 

1.5 

10.5 

7.5 

— 

1.5 

9 

4.5 

di 

8 

2 

6 

3 

1 

6 

8 

11 

di do 

0 

-6 

-2 

-5 

-7 

-2 

0 

3 

Rank 

— 

12 

4.5 

10.5 

13 

4.5 

— 

7.5 
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Discarding zero differences, we have w + = 15, n = 13, pw + = (13)(14)/4 = 45,5, and 
<j w+ = \J (13)(14)(27)/24 = 15.309. Therefore, z = ( 15 - 45.5)/14.309 = -2.13 
Decision: Reject H 0 ; the average increase is less than 8 points. 

16.14 The hypotheses 


Ho '■ Pa — Pb — 0 

Hi : pa - Pb ^ 0 . 


a = 0.05. 

Critiral region: w < 21 for n = 14. 
Computations: 



0.09 

0.08 

0.12 

0.06 

0.13 

-0.06 

0.12 

Rank 

7 

5.5 

10 

2.5 

12 

2.5 

10 


0.11 

0.12 

-0.04 

0.08 

0.15 

0.07 

0.14 

Rank 

8 

10 

1 

5.5 

14 

4 

13 


Hence, w + = 101.5, w _ = 3.5, so w — 3.5. 

Decision: Reject H 0 ] the different instruments lead to different results. 

16.15 The hypotheses 


Ho ■ Pb — Pa 
H\ : pb < Pa- 


a = 0.05. 

Critiral region: n i = 3, n -2 = 6 so u\ < 2. 
Computations: 


Original data 

1 7 8 9 10 11 12 13 14 

Rank 

1 2* 3* 4 5* 6 7 8 9 


Now w\ = 10 and hence u\ — 10 — (3)(4)/2 = 4 

Decision: Do not reject H 0 ] the claim that the tar content of brand B cigarettes is 
lower than that of brand A is not statistically supported. 

16.16 The hypotheses 


Ho '■ Pi — P 2 

H\ : pi < p2 • 


a = 0.05. 

Critiral region: u \ < 2. 
Computations: 
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Original data 

0.5 0.9 

1.4 

1.9 

2.1 2.8 

3.1 

4.6 5.3 

Rank 

1 * 2 

3 

4 * 

5 6 * 

7* 

8 9 


Now wi = 18 and hence ui — 18 — (4)(5)/2 = 8 
Decision: Do not reject Hq. 

16.17 The hypotheses 


Ho ■ if a — pB 

H\ : Pa > Pb- 


a = 0 . 01 . 

Critiral region: u 2 < 14. 
Computations: 


Original data 

3.8 

4.0 

4.2 

4.3 

4.5 

4.5 

4.6 

4.8 

4.9 

Rank 

1 * 

2* 

3* 

4 * 

5.5* 

5.5* 

7 

8 * 

9* 

Original Data 

5.0 

5.1 

5.2 

5.3 

5.5 

5.6 

5.8 

6.2 

6.3 

Rank 

10 

11 

12 

13 

14 

15 

16 

17 

18 


Now w 2 = 50 and hence u 2 = 50 — (9) (10) / 2 = 5 
Decision: Reject H 0 ; calculator A operates longer. 

16.18 The hypotheses 


Ho '■ pi — P2 

Hi : pi ^ p 2 • 


a = 0 . 01 . 

Critiral region: u < 27. 
Computations: 


Original data 

8.7 

9.3 

9.5 

9.6 

9.8 

9.8 

9.8 

9.9 

9.9 

10.0 

Rank 

1 * 

2 

3* 

4 

6 * 

6 * 

6 * 

8.5* 

8.5 

10 

Original Data 

10.1 

10.4 

10.5 

10.7 

10.8 

10.9 

11.0 

11.2 

11.5 

11.8 

Rank 

11 * 12 

13* 

14 

15* 

16 

17* 

* 

OO 

t-H 

19 

20 



Here is for process 2. Now w 1 = 111.5 for process 1 and w 2 = 98.5 for process 2. 
Therefore, v,\ = 111.5 — (10) (11) / 2 = 56.5 and u 2 = 98.5 — (10) (11) /2 = 43.5, so that 
u = 43.5. 

Decision: Do not reject H 0 . 

16.19 The hypotheses 


Ho '■ pi — P 2 
H\ : p\ p p 2 . 
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a = 0.05. 

Critiral region: u < 5. 
Computations: 


Original data 

64 

67 

69 

75 

78 

79 

80 

82 87 

88 

91 

93 

Rank 

1 

2 

3* 

4 

5* 

6 

7 * 

8 9* 

10 

11* 

12 


Nowmji = 35 and w 2 = 43. Therefore, Ui = 35— (5)(6)/2 = 20 and u 2 = 43— (7) (8) / 2 = 
15, so that u = 15. 

Decision: Do not reject Hq. 


16.20 The hypotheses 


Hq : Pi — jil‘2 
H\ : jii\ ^ h2- 

a = 0.05. 

Critiral region: Z < —1.96 or z > 1.96. 
Computations: 


Observation 

12.7 

13.2 

13.6 

13.6 

14.1 

14.1 

14.5 

14.8 

15.0 

15.0 

15.4 

Rank 

1* 

2 

3.5* 

3.5 

5.5* 

5.5 

7 

8 

9.5* 

9.5 

11.5* 

Observation 

15.4 

15.6 

15.9 

15.9 

16.3 

16.3 

16.3 

16.3 

16.5 

16.8 

17.2 

Rank 

11.5 

13* 

14.5* 

14.5 

17.5* 

17.5* 

17.5 

17.5 

20 

21* 

22 

Observation 

17.4 

17.7 

17.7 

18.1 

18.1 

18.3 

18.6 

18.6 

18.6 

19.1 

20.0 

Rank 

23 

24.5 

24.5* 

26.5* 

26.5 

28 

30 

30* 

30 

32 

33 


Now w i = 181.5 and v,\ = 181.5 — (12) (13) /2 = 103.5. Then with pu 1 = (21) (12) /2 = 
126 and a Vl = N /(21)(12)(34)/12 = 26.721, we find z = (103.5 - 126)/26.721 = -0.84. 
Decision: Do not reject Hq. 

16.21 The hypotheses 

Hq : Operating times for all three calculators are equal. 

H\ : Operating times are not all equal. 

a = 0 . 01 . 

Critiral region: h > Xooi = 9.210 with v — 2 degrees of freedom. 

Computations: 
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Ranks for Calculators 

A 

B 

C 

4 

8.5 

15 

12 

7 

18 

1 

13 

10 

2 

11 

16 

6 

8.5 

14 

ri = 25 

5 

17 


3 

r 3 = 90 

r 2 = 56 


12 


25 2 | 56 2 | 90 2 

5 7 ~ r 6 


(3)(19) = 10.47. 


Now h = mm 

Decision: Reject H 0 ; the operating times for all three calculators are not equal. 


16.22 Kruskal- Wallis test (Chi-square approximation) 

- (3)(33) = 20.21. 

Xo.o 5 = 5.991 with 2 degrees of freedom. So, we reject H 0 and claim that the mean 
sorptions are not the same for all three solvents. 


h = 


12 


(32) (33) 9 


210. 5 2 189 2 128. 5 J 

+ + 


15 


16.23 The hypotheses 


H 0 : Sample is random. 

Hi : Sample is not random. 


a = 0 . 1 . 

Test statistics: V, the total number of runs. 

Computations: for the given sequence we obtain ri\ = 5, n 2 = 10, and v = 7. Therefore, 
from Table A. 18, the P - value is 

P = 2 P(V < 7 when H 0 is true) = (2) (0.455) = 0.910 >0.1 

Decision: Do not reject H 0 ] the sample is random. 

16.24 The hypotheses 


H 0 : Fluctuations are random. 

Hi : Fluctuations are not random. 


a = 0.05. 

Test statistics: V, the total number of runs. 

Computations: for the given sequence we find x = 0.021. Replacing each measurement 




266 


Chapter 16 Nonpar ametric Statistics 


by the symbol “+” if it falls above 0.021 and by the symbol ” if it falls below 0.021 
and omitting the two measurements that equal 0.021, we obtain the sequence 

----- + + + + + 
for which rq = 5, n 2 = 5, and v = 2. Therefore, the P-value is 

P = 2 P(V < 2 when H 0 is true) = (2) (0.008) = 0.016 < 0.05 
Decision: Reject Hq ; the fluctuations are not random. 

16.25 The hypotheses 


Ho : pa — hB 
Hi : Pa > /he- 


a = 0 . 01 . 

Test statistics: V, the total number of runs. 

Computations: from Exercise 16.17 we can write the sequence 

BBBBBBABBAABAAAAAA 

for which n \ = 9, rq = 9, and v = 6. Therefore, the P-value is 

P = P(V < 6 when H 0 is true) = 0.044 > 0.01 

Decision: Do not reject Ho. 

16.26 The hypotheses 


H 0 : Defectives occur at random. 

H 1 : Defectives do not occur at random. 


a = 0.05. 

Critical region: z < —1.96 or z > 1.96. 

Computations: n \ = 11, n 2 = 17, and v = 13. Therefore, 


Pv — 


(2) (11) (17) 


1 = 14.357, 


= 


(2)(11)(17)[(2)(11)(17) — 11 — 17] 
(28 2 )(27) 


6.113, 


and hence cry = 2.472. Finally, 


z = ( 13 - 14.357)/2.472 = -0.55. 


Decision: Do not reject H 0 . 
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16.27 The hypotheses 


Hq : Sample is random. 

Hi : Sample is not random. 

a = 0.05. 

Critical region: z < —1.96 or z > 1.96. 

Computations: we find x = 2.15. Assigning “+” and ” signs for observations above 
and below the median, respectively, we obtain n \ = 15, n-i = 15, and v = 19. Hence, 


Pv — 



mam +1=16 , 

30 

(2) (15) (15) [(2) (15) (15) — 15 — 15] 
(30 2 )(29) 


7.241, 


which yields Gy = 2.691. Therefore, 


z = (19 — 16)/2.691 = 1.11. 


Decision: Do not reject H 0 . 

16.28 1 - 7 = 0.95, 1 - a = 0.85. From Table A.20, n = 30. 

16.29 n — 24, 1 — a = 0.90. From Table A.20, 1 — 7 = 0.70. 

16.30 1 — 7 = 0.99, 1 — a = 0.80. From Table A. 21, n = 21. 

16.31 n = 135, 1 - a = 0.95. From Table A.21, 1 - 7 = 0.995. 

16.32 (a) Using the computations, we have 


Student 

Test 

Exam 

di 

L.S.A. 

4 

4 

0 

W.P.B. 

10 

2 

8 

R.W.K. 

7 

8 

-1 

J.R.L. 

2 

3 

-1 

J.K.L. 

5 

6.5 

-1.5 

D.L.P. 

9 

6.5 

2.5 

B.L.P. 

3 

10 

-7 

D.W.M. 

1 

1 

0 

M.N.M. 

8 

9 

-1 

R.H.S. 

6 

5 

-1 


_ J6K12A5L 

( 10 )( 100 - 1 ) 


0.24. 
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(b) The hypotheses 


H 0 : p = 0 

Hi : p > 0 


a = 0.025. 

Critical region: rg > 0.648. 
Decision: Do not reject Hq. 

16.33 (a) Using the following 


Ranks 

d 

Ranks 

d 

X 

y 

X 

y 

1 

6 

-5 

14 

12 

2 

2 

1 

1 

15 

2 

13 

3 

16 

-13 

16 

6 

10 

4 

9.5 

-5.5 

17 

13.5 

3.5 

5 

18.5 

-13.5 

18 

13.5 

4.5 

6 

23 

-17 

19 

16 

3 

7 

8 

-1 

20 

23 

-3 

8 

3 

5 

21 

23 

-2 

9 

9.5 

-0.5 

22 

23 

-1 

10 

16 

-6 

23 

18.5 

4.5 

11 

4 

7 

24 

23 

1 

12 

20 

-8 

25 

6 

19 

13 

11 

2 





we obtain rs — 1 — ( 25 )(G 25 -i) = 0-39. 
(b) The hypotheses 


H o : p = 0 


a = 0.05. 

Critical region: rg < —0.400 or r s > 0.400. 
Decision: Do not reject Hq. 

16.34 The numbers come up as follows 


Ranks 

d 

Ranks 

d 

Ranks 

d 

X 

y 

X 

y 

X 

y 

3 

7 

-4 

4 

6 

-2 

7 

3 

4 

6 

4.5 

1.5 

8 

2 

6 

5 

4.5 

0.5 

2 

8 

-6 

1 

9 

-8 

9 

1 

8 
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Y d 2 = 238.5, r,s 


(6)(238.5) 

(9)(80) 


-0.99. 


16.35 (a) We have the following table: 


Weight 

Chest Size 

d% 

Weight 

Chest Size 

di 

Weight 

Chest Size 

di 

3 

6 

-3 

1 

1 

0 

8 

8 

0 

9 

9 

0 

4 

2 

2 

7 

3 

4 

2 

4 

-2 

6 

7 

-1 

5 

5 

0 


(b) The hypotheses 


r,s 


(6) (34) 
(9) (80) 


0.72. 


H 0 : p = 0 
H\ : p > 0 


a = 0.025. 

Critical region: rs > 0.683. 

Decision: Reject H 0 and claim p > 0. 


16.36 The hypotheses 


H 0 : p = 0 
Hr : p ± 0 


a = 0.05. 

Critical region: r s < —0.683 or rg > 0.683. 
Computations: 


Manufacture 

A 

B 

C 

D 

E 

F 

G 

H 

I 

Panel rating 

6 

9 

2 

8 

5 

1 

7 

4 

— 

Price rank 

5 

1 

9 

8 

6 

7 

2 

4 

3 

d{ 

1 

8 

-7 

0 

-1 

-6 

5 

0 

0 


Therefore, r s = 1 - = -0.47. 

Decision: Do not reject Hq. 

16.37 (a) £ d 2 = 24, r s = 1 - = 0.71. 

(b) The hypotheses 


H 0 : p = 0 

Hi : p > 0 
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16.38 


16.39 


16.40 


a = 0.05. 

Critical region: r\g > 0.643. 

Computations: r\g = 0.71. 

Decision: Reject H 0 , p > 0. 

(a) £ d 2 = 1828, r s = 1 — = 0.59. 

(b) The hypotheses 

H 0 : p = 0 

a = 0.05. 

Critical region: r\g < —0.364 or r s > 0.364. 

Computations: r\g = 0.59. 

Decision: Reject H 0 , p ^ 0. 

(a) The hypotheses 

Ho '■ Pa = hB 
Hi : pa ^ Rs 

Test statistic: binomial variable X with p — 1/2. 

Computations: n = 9, omitting the identical pair, so x = 3 and P-value is 
P = P(X < 3) = 0.2539. 

Decision: Do not reject H 0 . 

(b) w + = 15.5, n = 9. 

Decision: Do not reject Hq. 

The hypotheses: 

Ho '■ Pi = P 2 = R3 = Rr- 

Hi : At least two of the means are not equal. 

a = 0.05. 

Critical region: h > Xom = 7.815 with 3 degrees of freedom. 

Computaions: 


Ranks for the Laboratories 

A 

B 

C 

D 

7 

18 

2 

12 

15.5 

20 

3 

10.5 

13.5 

19 

4 

13.5 

8 

9 

1 

15.5 

6 

10.5 

5 

17 

ri = 50 

r 2 = 76.5 

CO 

Oi 

r 4 = 68.5 
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Hq : /i-29 — h-54 ~ h 84- 

i?! : At least two of the means are not equal. 
Kruskal- Wallis test (Chi-squared approximation) 


12 T6 2 38 2 34 2 ’ 

7 ““ (12)(13) [3 + T" + ~T_ 


(3)(13) = 6.37, 


with 2 degrees of freedom. Xom = 5.991. 

Decision: reject Hq. Mean nitrogen loss is different for different levels of dietary protein. 




Chapter IT 

Statistical Quality Control 


17.1 Let Y = X\ + X 2 + • • • + X n . The moment generating function of a Poisson random 
variable is given by Mx(t ) = e^ e _1 d By Theorem 7.10, 

M Y (t ) = • e^ et ~^ ■ ■ ■ 

which we recognize as the moment generating function of a Poisson random variable 

n 

with mean and variance given by ^ fii- 

i = 1 


17.2 The charts are shown as follows. 



0.015 i UCL 



Sample 


Sample 


Although none of the points in i?-chart is outside of the limits, there are many values 
fall outside control limits in the A"-chart. 

17.3 There are 10 values, out of 20, fall outside the specification ranges. So, 50% of the 
units produced by this process will not confirm the specifications. 

17.4 X = 2.4037 and a = | = ^ff|p = 0.00298. 

17.5 Combining all 35 data values, we have 


x = 1508.491, R = 11.057, 
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so for X-chart, LCL = 1508.491 - (0.577) (11.057) = 1502.111, and UCL = 1514.871; 
and for i?-chart, LCL = (11.057) (0) = 0, and UCL = (11.057) (2.114) = 23.374. Both 
charts are given below. 




The process appears to be out of control. 

17.6 

P = P(Z < 3 - 1.5\/5) - P(Z < -3 - 1.5 V5) 

= P(Z < -0.35) - P(Z < -6.35) « 0.3632. 

So, 

E(S) = 1/(1 - 0.3632) = 1.57, and a s = sfp(l - P ) 2 = 0.896. 

17.7 From Example 17.2, it is known than 

LCL = 62.2740, and UCL = 62.3771, 

for the X- chart and 

LCL = 0, and LICL = 0.0754, 
for the S'-chart. The charts are given below. 
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The process appears to be out of control. 

17.8 Based on the data, we obtain p = 0.049, LCL = 0.049 — 3 ^ a049 b a9j b = —0.043, and 

LCL = 0.049 + 3 ^°' 049 ^°- 951) = 0.1406. Based on the chart shown below, it appears 
that the process is in control. 



17.9 The chart is given below. 



Although there are a few points closed to the upper limit, the process appears to be 
in control as well. 


17.10 We use the Poisson distribution. The estimate of the parameter A is A = 2.4. So, the 
control limits are LCL = 2.4 — 3^2.4 = —2.25 and LICL = 2.4 + 3^2.4 = 7.048. The 
control chart is shown below. 




Chapter 18 

Bayesian Statistics 


18.1 For p = 0.1, 6( 2; 2, 0.1) = g) (0.1) 2 = 0.01. 

For p = 0.2, 6(2; 2, 0.2) = g) (0.2) 2 = 0.04. Denote by 

A : number of defectives in our sample is 2; 

Bi : proportion of defective is p — 0.1; 

B 2 : proportion of defective is p = 0.2. 

Then 

PiB^A) = (0 ' 6)(0 ' 01) = 0.27, 

V 1 ; (0.6)(0.01) + (0.4)(0.04) 

and then by subtraction P(B 2 \A ) = 1 — 0.27 = 0.73. Therefore, the posterior distribu- 
tion of p after observing A is 

p 0.1 0.2 

n(p\x = 2) 0.27 0.73 


for which we get p* = (0.1)(0.27) + (0.2)(0.73) = 0.173. 

18.2 (a) For p = 0.05, 6(2; 9, 0.05) = © (0.05) 2 (0.95) 7 = 0.0629. 

For p = 0.10, 6(2; 9, 0.10) = g) (0.10) 2 (0.90) 7 = 0.1722. 

For p = 0.15, 6(2; 9, 0.15) = g) (0.15) 2 (0.85) 7 = 0.2597. 

Denote the following events: 

A : 2 drinks overflow; 

B\ : proportion of drinks overflowing is p = 0.05; 

B 2 : proportion of drinks overflowing is p — 0.10; 

i ?3 : proportion of drinks overflowing is p — 0.15. 


Then 

P( n U) = (°- 3 )(°- 0629 ) 

v 1 ’ (0.3)(0.0629) + (0.5)(0.1722) + (0.2)(0.2597) 

P(B = (0.5)(0.1722) 

1 21 ; (0.3)(0.0629) + (0.5)(0.1722) + (0.2)(0.2597) 


0 . 12 , 

0.55, 
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and P{Bz\A) = 1 — 0.12 — 0.55 = 0.33. Hence the posterior distribution is 


p 

0.05 

0.10 

0.15 

7 j(p\x = 2) 

0.12 

0.55 

0.33 


(b) p* = (0.05)(0.12) + (0.10)(0.55) + (0.15)(0.33) = 0.111. 
18.3 (a) Let A" = the number of drinks that overflow. Then 


/0|p) = b(x;4,p) = ( )p x (l-p ) 

, oc 


4—x 


for x — 0, 1, 2, 3, 4. 


Since 


/(i,p) = /ObOO) 


10 


p(l -p) ; 


40p(l - p ) 3 , for 0.05 < p < 0.15, 


then 


<■ 0.15 


fi/l) — 40 / p(l - pf dp = -2(1 -p) 4 (4p+ 1)|J;^ = 0.2844, 


' 0.05 


and 


7r(p0 = 1) = 40p(l — p) 3 /0.2844. 

(b) The Bayes estimator 

40 O' 15 


P = 


0-2844 J om 
40 

(0.2844) (60) 


p 2 (l — py dp 

p 3 (20 - 45 p + 36p 2 - !Op 3 )|°o5 = 0.106. 


18.4 Denote by 

A : 12 condominiums sold are units; 

B\ : proportion of two-bedroom condominiums sold 0.60; 

B -2 : proportion of two-bedroom condominiums sold 0.70. 


For p = 0.6, 6(12; 15,0.6) = 0.0634 and for p = 0.7, 6(12; 15,0.7) = 0.1701. The prior 
distribution is given by 


P 

0.6 

0.7 

n(p) 

1/3 

2/3 


So, P(B t \A) = IV3 )(..MS ) w77)ki7»i) = °' 157 “ d PWA) = 1 - 0.157 = 0.843. 
Therefore, the posterior distribution is 
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p 

0.6 

0.7 

7 r(p\x = 12) 

0.157 

0.843 


(b) The Bayes estimator is p* = (0.6)(0.157) + (0.7)(0.843) = 0.614 
18.5 n — 10, x — 9, cr = 0.8, po = 8, ao = 0.2, and Z0.025 = 1.96. So, 
(10)(9)(0.04) + (8)(0.64) 


Pi — 


(10)(0.04) + 0.64 


= 8.3846, < 7 1 = 


(0.04) (0.64) 
(10) (0.04) + 0.64 


= 0.1569. 


To calculate Bayes interval, we use 8.3846 ± (1.96) (0.1569) = 8.3846 ± 0.3075 which 
yields (8.0771,8.6921). Hence, the probability that the population mean is between 
8.0771 and 8.6921 is 95%. 

18.6 n = 30, x = 24.90, s = 2.10, po = 30 and <+) = 1.75. 

( a ) ^ = = 25 1336 

(b) or* = \J .Jjf+o-2 = \J l i 6 5 2°85 6 = °- 3745 > an<4 +).025 = 1-96. Hence, the 95% Bayes 
interval is calculated by 25.13 + (1.96)(0.3745) which yields $23.40 < p < $25.86. 

(c) P(24 < p < 26) = P (^ytf < z < ^y|P) = P(— 3.02 < Z < 2.32) = 
0.9898 - 0.0013 = 0.9885. 


18.7 (a) P(71.8 < p < 73.4) = P ( 


71.8-72 

+5/76 


< Z < 


0.2509 

(b) n = 100, x = 70, s 2 = 64, p 0 = 72 and Ug = 5.76. Hence, 

(100) (70) (5.76) + (72) (64) 


++ ) = ‘PI - °-08 < Z < 0.58) = 


Pi — 


cr i = 


(100)(5.76) + 64 

= 0.759. 


= 70.2, 


(5.76) (64) 


(100)(5.76) + 64 


Hence, the 95% Bayes interval can be calculated as 70.2 ± (1.96) (0.759) which 
yields 68.71 < p < 71.69. 

(c) P(71.8 < p < 73.4) = P ( 71 0 8 ~ 7 9 a2 < Z < 73 0 4 - 70 - 2 ) = P(2.11 < Z < 4.22) = 
0.0174. 


18.8 Multiplying the likelihood function 
f(x i,x 2 , • • .,x n \ p) = 


(2vr) 25 / 2 100 25 


exp 


by the prior n(p) = 75 for 770 < p < 830, we obtain 


f (*M 5 rC‘i‘ ) • • • j rc n 1 P ) 


(60)(2vr) 25 / 2 1 00 25 


exp 


25 




1=1 


25 

E 

i = 1 


Xi p 

100 


Xi 


p 


100 


1 / ££—780 \ 2 


= Ke 2 \ 20 
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where K is a function of the sample values. Since the marginal distribution 

/■830 


g(x 1 , X2,..., x n ) = v / 27 t (20 )/i 


1 


Lv^o 


1 ( m-780 \ 2 

e h ioo ) dfi 


'770 


= +2+13.706 )K. 


Hence, the posterior distribution 

f(x i,x 2 , . . .,x n ) 


n(p\x 1 ,x 2 , . . < } x n ) = 


g(xi,x 2 ,...,x n ) +2+13.706) 


1 ( /a— 780 \ 2 

e 2 l 20 1 


for 770 < p < 830. 

18.9 Multiplying the likelihood function and the prior distribution together, we get the joint 
density function of 6 as 


f(ti,t 2 , ,t n ,0) = 2d” exp 


-6 ( t{ + 2 


, 2=1 


, for 6 > 0. 


Then the marginal distribution of (Tj, T 2 , , T n ) is 


g{ti,t 2 , . . . ,t n ) = 2 / 6> n exp 

do 


2T(n + 1) 


-9 1 y ^ ti + 2 


. i=l 


d”exp 


d9 


—9 ( ti + 2 
. 2=1 


yi u + 2 
2=1 

2T(n + 1) 


n+1 


n \ -(n+1) 

T( n + 1) ( Ylu + 2 
0=1 




X] ti + 2 

2=1 


n+1 ’ 


since the integrand in the last term constitutes a gamma density function with pa- 
rameters a — n + 1 and (5=1/ U + 2^j . Hence, the posterior distribution of 9 

is 


+# + , ...,t n ) = 


yi tj + 2 

2=1 


n+1 


/(fi, ...,t n ,9) WB 

g{ti,---,t n ) r(n + i) 


-d n exp 


■ 9 I ti + 2 


, i=l 


for d > 0, which is a gamma distribution with parameters a = n + 1 and (5 = 

Mg*, + 2 ). 
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18.10 Assume that p(xi |A) = e 


-A A x i 
. 1 1 


Xi — 0, 1, , for i — 1, 2, , n and 7 t(A) = ^A 2 e A//2 , 


for A > 0. The posterior distribution of A is calculated as 


n 

e -(ra+l/2)A^i?i Xl+2 

7r(A|a?i, . . . , x n ) = 

/ 0 °° e -C-+i/2) x\£i Xi+2 dX 


(■ n + l/2) nS+3 
T (nx + 3) 


A.S a ’ ,+2 e -(»+i/2)A i 


which is a gamma distribution with parameters a — nx + 3 and (3 = (n + 1/2) 1 , 
with mean ^+3 ■ Hence, plug the data in we obtain the Bayes estimator of A, under 
squared-error loss, is A* = 1 q^ i 3 2 = 5.7143. 

18.11 The likelihood function of p is ( x ~ 1 )p 5 (l — p) x ~ 5 and the prior distribution is n (p) = 1. 
Hence the posterior distribution of p is 


7T ip\x) 


p 5 (l-p) x ~ 5 _ T(x + 2) 5 

foP 5 (l — p) x ~ 5 dp T(6)T(a; — 4)^ 


which is a Beta distribution with parameters a = 6 and f3 — x — 4. Hence the Bayes 
estimator, under the squared-error loss, is p* = 



